X)/T WORLD INTELLECTUAL PROPERTY ORGANIZATION 

X X International Bureau 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 : 
C12N 15/00, C12P 19/34 



(11) International Publication Number: WO 90/14424 

(43) International Publication Date : 29 November 1 990 (29. 1 1 .90) 



(21) Internationa! Application Number: PCT/US90/02835 

(22) International Filing Date: 16 May 1990 (16.05.90) 



(30) Priority data: 

352,927 
353.235 
352.884 
353,241 
410,716 
410,735 
410,749 
410,888 
411,058 
446,333 
497,106 



16 May 1989(16.05.89) US 

16 May 1989(16.05.89) 

17 May 1989(17.05.89) 
17 May 1989(17.05.89) 
20 September 1989 (20.09.89) US 
20 September 1989 (20.09.89) US 
20 September 1989 (20.09.89) US 

20 September 1989 (20.09.89) US 

21 September 1989 (21.09.89) US 
4 December 1989 (04.12.89) US 
20 March 1990 (20.03.90) US 



(71) Applicant: SCRIPPS CLINIC AND RESEARCH FOUN- 
DATION [US/US]; 10666 North Torrey Pines Road, La 
Joila, CA 92037 (US). 



; 471 Avenida Primavera, 



(74) Agents: BINGHAM, Douglas, A.; Dressier, Goldsmith, 
Shore. Sutker & Milnamow. Ltd.. 11300 Sorrento Valley 
Road, #200, San Diego, CA 92121 (US) et al. 



(81) Designated States: AT (European patent), AU, BE (Euro- 
pean patent), CH (European patent), DE (European pa- 
tent)*, DK (European patent), ES (European patent). 
FR (European patent), GB (European patent), IT (Euro- 
pean patent),_JP, LU (European patent), NL (European 
patent), Sb (European patent). 



With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt 
amendments. 



(54) Title: METHOD FOR ISOLATING RECEPTORS HAVING A PRESELECTED SPECIFICITY 



(57) Abstract 

The present invention relates to a method for isolating from a repertoire of genes containing a conserved nucleotide se- 
quence a gene coding for a receptor having a preselected activity. Receptors produced by the gene isolated by the method, particu- 
larly catalytic receptors, are also contemplated. 



* See back of page 



DESiGNATiONS OF "DE H 



Until further notice, any designation of "DE" in any international application 
whose international filing date is prior to October 3, 1990, shall have effect in the 
territory of the Federal Republic of Germany with the exception of the territory of the 
former German Democratic Republic. 





FOR THE PURPOSES OF INFORMATION ONLY 


codes used to identify States party t 


3 the PCT on the Front pa 


ges of pamphlets publishing international 


applications under the PCT. 






AT Austria 


ES 


Spain _ 


MC Monaco 


AU Australia 


F! 




MG Madagascar 


BB Barbados 


FR 




ML Mali 


BE Belgium 


GA 


Gabon 


MR Mauritania 


BF Burkina Fsbso 


GB 


United Kingdom 


MW Malawi 


BG Bulgaria 


GR 




NL Netherlands 


BJ Benin 


HU 


Hungary 


NO Norway 


BR Brazil 


!T 


Italy 


RO Romania 


CA Canada 


JP 




SD Sudan 


CF Central African Republic 


KP 


Democratic People's Republic 


SE Sweden 


CG Congo 








CH Switzerland 


KR 


Republic or Korea 


SU Soviet Union 


CM Cameroon 


LI 


Liechtenstein 


TD Chad 


r>R Germany, Federal Republic of 


LK 


Sri Lanka 


TG Togo 


DK Denmark 


LU 


Luxembourg 


US United Stales of America 



WO 90/14424 



I 



PCT/US90/02835 



METHOD FOR ISOLATING RECEPTORS HAVING A PRESELECTED SPECIFICITY 
Description 

Technical Fie3d 

The present invention relates to a method 
for producing polymers having a preselected 
activity. 

Background 

Binding phenomena between ligands and 
receptors play many crucial roles in biological 
systems. Exemplary of such phenomena are the 
binding of oxygen molecules to deoxyhemoglobin to 
form oxyhemoglobin, and the binding of a substrate 
to an enzyme that acts upon it such as between a 
protein and a protease like trypsin. Still further 
examples of biological binding phenomena include 
the binding of an antigen to an antibody, and the 
binding of complement component C3 to the so-called 
CR1 receptor. 

Many drugs and other therapeutic agents 
are also believed to be dependent upon binding 
phenomena. For example, opiates such as morphine 
re reported to bind to specific receptors in the 
brain. Opiate agonists and antagonists are 
reported to compete with drugs like morphine for 
those binding sites. 

Ligands such as man-made drugs, like 
morphine and its derivatives, and those that are 
naturally present in biological systems such as 
endorphins and hormones bind to receptors that are 
naturally present in 'biological systems, and will 
be treated together herein. Such binding can lead 
to a number of the phenomena of biology, including 



particularly the hydrolysis of amide and ester 
bonds as where proteins are hydrolyzed into 
constituent polypeptides by an enzyme such as 
trypsin or papain or where a fat is cleaved into 
glycerine and three carboxylic acids, respectively. 
In addition, such binding can lead to formation of 
amide and ester bonds in the formation of proteins 
and fats, as well as to the formation of carbon to 
carbon bonds and carbon to nitrogen bonds. 

An exemplary receptor-producing system in 
vertebrates is the immune system. The immune 
system of a mammal is one of the most versatile 
biological systems as probably greater than 1.0 x 
10 7 receptor specificities, in the form of 
antibodies, can be produced. Indeed, much of 
contemporary biological and medical research is 
directed toward tapping this repertoire. During 
the last decade there has been a dramatic increase 
in the ability to harness the output of the vast 
immunological repertoire. The development of the 
hybridoma methodology by Kohler and Milstein has 
made it possible to produce monoclonal antibodies , 
i=e,, a composition of antibody molecules of a 
single specificity, from the repertoire of 
antibodies induced during an immune response. 

Unfortunately, current methods for 
generating monoclonal antibodies are not capable of 
efficiently surveying the entire antibody response 
induced by a particular immunogen. In an 
individual animal there are at least 5-10,000 
different B-cell clones capable of generating 
unique antibodies to a small relatively rigid 
immunogens, such as, for example dinitrophenol . 
Further, because of the process of somatic mutation 
during the generation of antibody diversity, 
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essentially an unlimited number of unique antibody 
molecules may be generated. In contrast to this 
vast potential for different antibodies, current 
hybridoma methodologies typically yield only a few 
5 hundred different monoclonal antibodies per fusion. 
Other difficulties in producing 
monoclonal antibodies with the hybridoma 
methodology include genetic instability and low 
production capacity of hybridoma cultures. One 
10 means by which the art has attempted to overcome 

these latter two problems has been to clone the 
immunoglobulin-producing genes from a particular 
hybridoma of interest into a procaryotic expression 
system. See. for example. Robinson et al., PCT 
15 Publication No. WO 89/0099; Winter et al., European 
Patent Publication No. 0239400; Reading, U.S. 
Patent No. 4,714,681; and Cabilly et al., European 
Patent Publication No. 0125023. 

The immunologic repertoire of vertebrates 
20 has recently been found to contain genes coding for 

immunoglobulins having catalytic activity. 
Tramontano et al., Sci. . 234:1566-1570 (1986); 
Pollack et al., Sci. . 234:1570-1573 (1986); Janda 
et al., Sci. . 241:1188-1191 (1988); and Janda et 
25 al., Sci. . 244:437-440 (1989). The presence of, or 

the ability to induce the repertoire to produce , 
antibodies molecules capable of a catalyzing 
chemical reaction, i.e., acting like enzymes, had 
previously been postulated almost 20 years ago by 
30 W. P. Jeneks in Catalysis in Chemis try and 

Enzvmoloo-v . McGraw-Hill, N.Y. (1969). 

It is believed that one reason the art 
failed to isolate catalytic antibodies from the 
immunological repertoire earlier, and its failure 
35 to isolate many to date even after their actual 



discovery, is the inability to screen a large 
portion of the repertoire for the desired activity. 
Another reason is believed to be the bias of 
currently available screening techniques, such as 
the hybridoma technique, towards the production 
high affinity antibodies inherently designed for 
participation in the process of neutralization, as 
opposed to catalysis. 

Brief Summary of the Invention 

The present invention provides a method 
for producing a linear double stranded DNA 
expression vector containing a 5« terminal promoter 
controlling the expression of first and second 
polypeptides from respective first and second 
genes, said first and second polypeptides being 
capable of forming a heterodimeric receptor of 
predetermined specificity. 

The vector is prepared by isolating a 
first diverse population of first genes containing 
a gene coding for said first polypeptide, said 
first population being constituted by linear double 
stranded DNA molecules defining a restriction site 
recognized by an endonuciease, said restriction 
site located 5 1 terminal to said genes but 3 1 
terminal to a promoter controlling the expression 
of said genes. A second diverse population of 
second genes containing a gene coding for said 
second polypeptide is then isolated, said second 
population being constituted by linear double 
stranded DNA molecules defining said restriction 
site, said restriction site located 3' terminal to 
said second genes, said second genes located 3 1 
terminal to a promoter controlling the expression 
of said second genes. The double stranded DNA 
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molecules are then cleaved while present in said 
populations with said endonuc lease to produce 
restriction fragments having a cohesive terminus 
and either one of said first genes or one of said 
5 second genes. The restriction fragments so 

produced are randomly iigated one to another via 
said cohesive terminus of each to produce a diverse 
population of double stranded linear DNA expression 
vectors having one of said first genes and one of 

10 said second genes in tandem on the same DNA strand 
and under the control of a single promoter. A 
vector capable of expressing said heterodimeric 
receptor of predetermined specificity is then 
isolated from the diverse population of double 

15 stranded DNA expression vectors. 



Brief Description of the Drawings 

in the drawings forming a portion of this 
disclosure: 

20 Figure 1 Illustrates a schematic 

diagram of the immunoglobulin molecule showing the 
principal structural features. The circled area on 
the heavy chain represents the variable region (V H ) , 
a polypeptide containing a biologically active 

25 (ligand binding) portion of that region, and a gene 

coding for that polypeptide, are produced by the 
methods of the present invention. 

Figure 2A Diagrammatic sketch of an H 
chain of human IgG (IgGl subclass) . Numbering is 
30 from the N-terminus on the left to the C-terminus 
on the right. Note the presence of four domains, 
each containing an intrachain disulfide bond (S-S) 
spanning approximately 60 amino acid residues. The 
symbol CHO stands for carbohydrate. The V region 
35 of the heavy (H) chain (V H ) resembles V L in having 



three hypervariable CDR (not shown) , 

Figure 2B Diagrammatic sketch of a human K chain 
(Panel 1) . Numbering is frm the N-terminus on the 
left to the C-terminus on the right. Note 
the intrachain disulfide bond (S-S) spanning about 
the same number of amino acid residues in the V L and 
C L domains. Panel 2 shows the locations of the 
complementarity-determining regions (CDR) in the v L 
domain. Segments outside the CDR are the framework 
segments (FR) . 

Figure 3 Amino acid sequence of the v H regions 
of 19 mouse monoclonal antibodies with specificity 
for phosphorylcholine. Each segment of the amino 
acid sequence is shown in consecutively numbered 
sheets. The designation HP indicates that the 
protein is the product of a hybridoma. The 
remainder are myeloma proteins. (From Gearhart et 
al., Nature, 291:29, 1981.) 

Figure 4 Illustrates the results obtained from 
PGR amplification of mRNA obtained from the spleen 
of a mouse immunized with FITC. Lanes R17-R24 
correspond to amplification reactions with the 
unique 5' primers (2=9, Table 1) and the 3-' primer 
(12, Table 1), R16 represents the PCR reaction with 
the 5' primer containing inosine (10, Table 1) and 
3' primer (12, Table 1). Z and R9 are the 
amplification controls; control Z involves the • 
amplification of V a from a plasmid (PLR2) and R9 
represents the amplification from the constant 
regions of spleen mRNA using primers 11 and 13 
(Table 1) . 

Figure 5 Nucleotide sequences are clones from 
the cDNA library of the PCR amplified V H regions in 
Lambda ZAP are shown in consecutively numbered 
sheets. The N-terminal 110 bases are listed here 
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and the underlined nucleotides represent CDR1 
(complementary determining region) . 
Sequences L03 , L35, L47 and L48 could not be 
classified into any predefined subgroups. 

Figure 6 The sequence of the synthetic DNA 
insert inserted into Lambda ZAP to produce Lambda 
Zap II V H (Panel A) and Lambda Zap V L (Panel B) 
expression vectors (each panel shown in 
consecutively numbered sheets) . The various 
features required for this vector to express the V H 
and V L -coding DNA homologs include the Shine- 
Dalgarno ribosome binding site, a leader sequence 
to direct the expressed protein to the periplasm as 
described by Mouva et al., J. Biol, chem. . 255:27, 
1980, and various restriction enzyme sites used to 
operatively link the V H and V L homologs to the 
expression vector. The V H expression=vector 
sequence also contains a short nucleic acid 
sequence that codes for amino acids typically found 
in variable regions heavy chain (V H Backbone) . This 
v H Backbone is just upstream and in the proper 
reading as the V H DNA homologs that are operatively 
linked into the Xho I and Spe I. The V L DNA 
homologs are operatively linked into the V L sequence 
(Panel B) at the Nco I and Spe I restriction enzyme 
sites and thus the V H Backbone region is deleted 
when the v L DNA homologs are operatively linked -into 
the V L vector. 

Figure 7 The major features of the bacterial 
expression vector Lambda Zap II v H (V H - expression 
vector) are shown. The synthetic DNA sequence from 
Figure 5 is shown at the top along with the T 3 
polymerase promoter from Lambda Zap II. The 
orientation of the insert in Lambda Zap Ii is 
shown. The V H DNA homologs are inserted into the 
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Xho I and Spe I restriction enzyme sites. The V H 
DNA are inserted into the Xho I and Spe I site and 
the read through transcription produces the 
decapeptide epitope (tag) that is located just 3' 
of the cloning sites. 

Figure 8 The major features of the bacterial 
expression vector Lambda Zap II V L (V L expression 
vector) are shown. The synthetic sequence shown in 
Figure 6 is shown at the top along with the T 3 
polymerase promoter from Lambda Zap II. The 
orientation of the insert in Lambda Zap II is 
shown. The V L DNA homologs are inserted into the 
phagemid that is produced by the in vivo excision- 
protocol described by Short et al . , Nucleic Acids 
Res^., 16:7583-7600, 1988. The V L DNA homologs are 
inserted into the Nco I and Spe I cloning sites of 
the phagemid. 

Figure 9 A modified bacterial expression vector 

Lambda Zap II V L II. This vector is constructed by 

inserting this synthetic DNA sequence, 

TGAATTCTAAACTAGTCGCCAAGGAGACAGTCATAATGAA 
TCGAACTTAAGATTTGATCAGCGGTTCCTCTGTCAGTATTACTT 

ATAGCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCTG 
TATGGATAACGGATGCCGTCGGCGACCTAACAATAATGAGCGAC 

CCCAACCAGCCATGGCCGAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 
GGGTTGGTCGGTACCGGCTCGAGCAGTCAAGATCTCAATTCGCCGGCAGCT 

into Lambda Zap II that has been digested with the 
restriction enzymes Sac I and Xho I. This sequence 
contains the Shine-Dalgarno sequence (Ribosome 
binding site) , the leader sequence to direct the 
expressed protein to the periplasm and the 
appropriate nucleic acid sequence to allow the v L 
DNA homologs to be operatively linked into the SacI 
and Xba I restriction enzyme sites provided by this 
vector. 

Figure 10 The sequence of the synthetic DNA 
segment inserted into Lambda Zap II to produce the 
lambda V L II-expression vector. The various features 
SUBSTITUTE SHEET 



and restriction endonuclease recognition sites are 
shown , 

Figure ii The vectors for expressing v H and V L 
separately and in combination are shown. The 
various essential components of these vectors are 
shown. The light chain vector or v L expression 
vector can be combined with the V H expression vector 
to produce a combinatorial vector containing both V H 
and V L operatively linked for expression to the same 
promoter. This figure is shown in consecutively 
numbered sheets. 

Figure 12 The labelled proteins 
immunoprecipitated from E. coli containing a V H and 
a v L DNA homolog are shown. In lane 1, the 
background proteins immunoprecipitated from E . coli 
that do not contain a V H or V L DNA homolog are 
shown. Lane 2 contains the V H protein 
immunoprecipitated from E. coli containing only a V H 
DNA homolog. In lanes 3 and 4, the co-migration of 
a V H protein a V L protein immunoprecipitated from E. 
coli containing both a V H and a V L DNA homolog is 
shown. In lane 5 the presence of V H protein and V L 
protein expressed from the V H and V L DNA hcmologs is 
demonstrated by the two distinguishable protein 
species. Lane 5 contains the background proteins 
immunoprecipitated by ar.ti- E. coli antibodies 
present in mouse ascites fluid. 

Figure 13 The transition state analogue (formula 
1) which induces antibodies for hydrolyzing 
carboxamide substrate (formula 2) . The compound of 
formula 1 containing a glutaryl spacer and a N- 
hydroxysuccinimide-1 inker appendage is the form 
used to couple the hapten (formula 1) to protein 
carriers KLH and BSA, while the compound of formula 
3 is the inhibitor. The phosphonamidate 
functionality is a mimic of the stereoelectronic 
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features of the transition state for hydrolysis of 
the amide bond. 

Figure 14 PCR amplification of Fd and 
kappa regions from the spleen mRNA of a mouse 
immunized with NPN is illustrated. Amplification 
was performed as described in Example 17 using RNA 
cDNA hybrids obtained by the reverse transcription 
of the mRNA with primer specific for amplification 
of light chain sequences (Table 2) or heavy chain 
sequences (Table 1) . Lanes F1-F8 represent the 
product of heavy chain amplification reactions with 
one of each of the eight 5« primers (primers 2-9, 
Table 1) and the unique 3« primer (primer 15. Table 
2). Light chain (k) amplifications with the 5 ! 
primers (primers 3-6, and 12, respectively, Table 
2) and the appropriate 3' primer (primer 13, Table 
2) are shown in lanes F9-F13. A band of 700 bps is 
seen in all lanes indicating the successful 
amplification of Fd and k regions. 

Figure 15 The screening of phage 
libraries for antigen binding is depicted according 
to Example 17C. Duplicate plaque lifts of Fab 
(filters A, B) , heavy chain (filters E,F) and light 
chain (filters G,H) expression libraries were 
screened against 125 I-labelled BSA conjugated with 
NPN at a density of approximately 30,000 plaques 
per plate. Filters c and D illustrate the 
duplicate secondary screening of a cored positive 
from a primary filter A (arrows) as discussed in 
the text. 

Screening employed standard plaque lift 
methods. XL1 Blue cells infected with phage were 
incubated on 150mm plates for 4h at 37°, protein 
expression induced by overlay with nitrocellulose 
filters soaked in lOraM isopropyl thiogalactoside 
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(IPTG) and the plates incubated at 25° for 8h. 
Duplicate filters were obtained during a second 
incubation employing the same conditions. Filters 
were then blocked in a solution of 1% BSA in PBS 
5 for Ih before incubation with rocking at 25° for Ih 

with a solution of t25 I-iabeiied BSA conjugated to 
NPN (2 x 10 6 cpm ml' 1 ; BSA concentration at 0*1 M; 
approximately 15 NPN per BSA molecule) in 1% 
BSA/PBS. Background was reduced by pre- 
10 centrifugation of stock radiolabelled BSA solution 

at 100,000 g for 15 min and pre-incubation of 
solutions with plague lifts from plates containing 
bacteria infected with a phage having no insert. 
After labeling, filters were washed repeatedly with 
15 PBS/0.05% Tween 20 before development of 

autoradiographs overnight. 

Figure 16 The specificity of antigen 
binding as shown by competitive inhibition is 
illustrated according to Example 17C. Filter lifts 
20 from positive plagues were exposed to 125 I-BSA-NPN 

in the presence of increasing concentrations of the 
inhibitor NPN. 

In this study a number of phages 
correlated with NPN binding as in Figure 15 were 
25 spotted (about 100 particles per spot) directly 
onto a bacterial lawns. The plate was then 
overlaid with an IPTG-soaked filter and incubated • 
for 19h at 25°. The filter were then blocked in 1% 
BSA in PBS prior to incubation in 125 I-BSA-NPN as 
30 described previously in Figure 15 except with the 

inclusion of varying amounts of NPN in the labeling 
solution. Other conditions and procedures were as 
in Figure 15. The results for a phage of moderate 
affinity are shown in duplicate in the figure. 
35 Similar results were obtained for four other phages 
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with some differences in the effective inhibitor 
concentration ranges. 

Figure 17 The characterization of an 
antigen binding protein is illustrated according to 
Example 17B. The concentrated partially purified 
bacterial supernate of an NPN-binding clone was 
separated by gel filtration and aliguots from each 
fraction applied to microtitre plates coated with 

BSA-NPN. Addition of either anti-decapeptide ( ) 

or anti-kappa chain ( ) antibodies conjugated 

with alkaline phosphatase was followed by color 
development. The arrow indicates the position of 
elution of a known Fab fragment. The results show 
that antigen binding is a property of 50 kD protein 
containing both heavy and light chains. 

Single plaques of two NPN-positive clones 
(Figure 15) were picked and the plasmid containing 
the heavy and light chain inserts excised. 500 ml 
cultures in L-broth were inoculated with 3 ml of a 
saturated culture containing the excised plasmids 
and incubated for 4h at 37*. Proteins synthesis was 
induced by the addition of IPTG to a final 
concentration of ImM and the cultures incubated for 
lOh at 25°. 200 ml of cells supernate were 
concentrated to 2 ml and applied to a TSK-G4000 
column. 50 ul aliguots from the eluted fractions 
were assayed by ELISA. 

For ELISA analysis, microtitre plates 
were coated with BSA-NPN at 1 ug/ml , 50 ul samples 
mixed with 50 ul PBS-Tween 20 (0.05%)-BSA (0.1%) 
added and the plates incubated for 2h at 25°. 
After washing with PBS=Tween 20-BSA, 50 ul of 
appropriate concentrations of a rabbit anti- 
decapeptide antibody (20) and a goat anti-mouse 
kappa light chain (Southern Biotech) antibody 
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conjugated with alkaline phosphatase were added and 
incubated for 2h at 25°. After further washing. 50 
ul of p-nitrophenyl phosphate (1 mg/ral in 0.1M tris 
pH 9.5 containing 50 mM MgC12) were added and the 
5 plates incubated for 15-30 min before reading the 

OD at 405 ran. 

Detailed Description of the Invention 
A. DEFINITIONS 
10 Nucleotide ; a monomeric unit of DNA 

or RNA consisting of a sugar moiety (pentose) , a 
phosphate, and a nitrogenous heterocyclic base. 
The base is linked to the sugar moiety via the 
glycosidic carbon (I 1 carbon of the pentose) and 
15 that combination of base and sugar is a nucleoside. 

When the nucleoside contains a phosphate group 
bonded to the 3 ' or 5 ' position of the pentose it 
is referred to as a nucleotide. 

Base Pair (bp) : a partnership of 
20 adenine (A) with thymine (T) , or of cytosine (C) 

with guanine (G) in a double stranded DNA molecule. 
In RNA, uracil (U) is substituted for thymine. 

Nucleic Acid; a polymer of 
nucleotides, either single or double stranded. 
25 Gene ; a nucleic acid whose 

nucleotide sequence codes for a RNA or polypeptide. 
A gene can be either RNA or DNA. 

Complementary Bases ; nucleotides 
that normally pair up when DNA or RNA adopts a 
30 double stranded configuration. 

Complementary Nucleotide Sequence: 
a sequence of nucleotides in a single-stranded 
molecule of DNA or RNA that is sufficiently 
complementary to that on another single strand to 
35 specifically hybridize to it with consequent 
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hydrogen bonding. 

Conserved : a nucleotide sequence is 
conserved with respect to a preselected (reference) 
sequence if it non-randomly hybridizes to an exact 
complement of the preselected sequence. 

Hybridization: the pairing of 
substantially complementary nucleotide sequences 
(strands of nucleic acid) to form a duplex or 
heteroduplex by the establishment of hydrogen bonds 
between complementary base pairs. It is a 
specific, i.e. non-random, interaction between two 
complementary polynucleotide that can be 
competitively inhibited. 

Nucleotide Analog: a purine or 
pyrimidine nucleotide that differs structurally 
from a, T, G, C, or U, but is sufficiently similar 
to substitute for the normal nucleotide in a 
nucleic acid molecule. 

DNA Homoloa : is a nucleic acid 
having a preselected conserved nucleotide sequence 
and a sequence coding for a receptor capable of 
binding a preselected ligand. 

Antibody : The term antibody in its 
various grammatical forms is used herein to refer 
to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., 
molecules that contain an antibody combining site ■ 
or paratope. Exemplary antibody molecules are 
intact immunoglobulin molecules, substantially 
intact immunoglobulin molecules and portions of an 
immunoglobulin molecule, including those portions 
known in the art as Fab, Fab', F(ab') 2 and F(v). 

Antibody Combining site: An 
antibody combining site is that structural portion 
of an antibody molecule comprised of a heavy and 
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light chain variable and hypervariable regions that 
specifically binds (immunoreacts with) an antigen. 
The term immunoreaet in its various forms means 
specific binding between an antigenic determinant- 
containing molecule and a molecule containing an 
antibody combining site such as a whole antibody 
molecule or a portion thereof. 

Monoclonal Antibody ; The phrase 
monoclonal antibody in its various grammatical 
forms refers to a population of antibody molecules 
that contains only one species of antibody 
combining site capable of immunoreacting with a 
particular antigen. A monoclonal antibody thus 
typically displays a single binding affinity for 
any antigen with which it immunoreacts. A 
monoclonal antibody may therefore contain an 
antibody molecule having a plurality of antibody 
combining sites, each immunospecif ic for a 
different antigen, e.g., a bispecific monoclonal 
antibody. 

B. METHODS 

The present invention contemplates a 
method of isolating from a repertoire of conserved 
genes a gene coding for a receptor having a 
preselected activity, preferably a catalytic 
activity. The receptor can be a polypeptide, an 
RNA molecule, such as a transfer RNA, an RNA 
displaying enzymatic activity, and the like. 
Preferably, the receptor will be a polypeptide 
capable of binding a ligand, such as an enzyme, 
antibody molecule or immunologically active portion 
thereof, cellular receptor, or cellular adhesion 
protein coded for by one of the members of a family 
of conserved genes, i.e., genes containing a 
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conserved nucleotide sequence of at least about 10 
nucleotides in length. 

Exemplary conserved gene families are 
those coding for immunoglobulins, major 
histocompatibility complex antigens of class I or 
II, lymphocyte receptors, integrins and the like. 

A gene can be identified as belonging to 
a repertoire of conserved genes using several 
methods. For example, an isolated gene may be used 
as a hybridization probe under low stringency 
conditions to detect other members of the 
repertoire of conserved genes present in genomic 
DNA using the methods described by Southern, j, 
Mol. Biol., 98:503 (1975). If the gene used as a 
hybridization probe hybridizes to multiple 
restriction endonuclease fragments that gene is a 
member of a repertoire of conserved genes. 

Immunoglobulins 

The immunoglobulins, or antibody 
molecules, are a large family of molecules that 
include several types of molecules, such as IgD- 
IgG, IgA, igM and IgE. The antibody molecule is 
typically comprised of two heavy (H) and light (L) 
chains with both a variable (V) and constant (C) 
region present on each chain. Several different 
regions of an immunoglobulin contain conserved 
sequences useful for isolating an immunoglobulin 
repertoire. Extensive amino acid and nucleic acid 
sequence data displaying exemplary conserved 
sequences is compiled for immunoglobulin molecules 
by Kabat et al. , in Sequences of Proteins nf 
Immunological Interest r National Institutes of 
Health, Bethesda, MO, 1987. 
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The C region of the H chain defines the 
particular immunoglobulin type. Therefore the 
selection of conserved sequences as defined herein 
from the C region of the H chain results in the 
5 preparation of a repertoire of immunoglobulin genes 
having members of the immunoglobulin type of the 
selected C region. 

The V region of the H or L chain 
typically comprises four framework (FR) regions 

10 each containing relatively lower degrees of 

variability that includes lengths of conserved 
sequences. The use of conserved sequences from the 
FR1 and FR4 (J region ) framework regions of the V H 
chain is a preferred exemplary embodiment and is 

15 described herein in the Examples. Framework 

regions are typically conserved across several or 
all immunoglobulin types and thus conserved 
sequences contained therein are particularly suited 
for preparing repertoires having several 

20 immunoglobulin types. 

Major Hist ocompatibility Complex 

The major histocompatibility complex 
(MHC) is a large genetic locus that encodes an 
25 extensive family of proteins that include several 
classes of molecules referred to as class I, class 
II or class III MHC molecules. Paul et al. r in 
Fundamental Immunology. Raven Press, NY, pp. 303- 
378 (1984). 

30 Class I MHC molecules are a polymorphic 

group of transplantation antigens representing a 
conserved family in which the antigen is comprised 
of a heavy chain and a non-MHC encoded light chain. 
The heavy chain includes several regions, termed 

35 the N, CI, C2, membrane and cytoplasmic regions. 
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Conserved sequences useful in the present invention 
are found primarily in the N, CI and C2 regions and 
are identified as continuous sequences of 
"invariant residues" in Kabat et al. f supra . 
5 Class II MKC molecules comprise a 

conserved family of polymorphic antigens that 
participate in immune responsiveness and are 
comprised of an alpha and a beta chain. The genes 
coding for the alpha and beta chain each include 

10 several regions that contain conserved sequences 

suitable for producing MKC class II alpha or beta 
chain repertoires. Exemplary conserved nucleotide 
sequences include those coding for amino acid 
residues 25-30 of the Al region, residues 161-170 

15 of the A2 region and residues 195-206 of the 

membrane region, all of the alpha chain. Conserved 
sequences are also present in the El, B2 and 
membrane regions of the beta chain at nucleotide 
sequences coding for amino acid residues 41-45. 

20 150-162 and 200-209, respectively. 

Lymphocyte Receptors a nd Cell Surface Antigens 

Lymphocytes contain several families of 
proteins on their ceil surfaces including the T- 

25 cell receptor, Thy-l antigen and numerous T-cell 

surface antigens including the antigens defined by 
the monoclonal antibodies 0KT4 (leu3) , OKUT5/8 
(leu2), 0KUT3, OKUTl (leul) , OKT 11 (leu5) 0KT6 and 
0KT9. Paul, supra at pp. 458-479. 

30 The T-cell receptor is a term used for a 

family of antigen binding molecules found on the 
surface of T-cells. The T-cell receptor as a 
family exhibits polymorphic binding specificity 
similar to immunoglobulins in its diversity. The 

35 mature T-cell receptor is comprised of alpha and 
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beta chains each having a variable (V) and constant 
(C) region . The similarities that the T-cell 
receptor has to immunoglobulins in genetic 
organization and function shows that T-cell 
receptor contains regions of conserved sequence, 
Lai et al., Nature . 331:543-546 (1988) . 

Exemplary conserved sequences include 
those coding for amino acid residues 84-90 of alpha 
chain, amino acid residues 107-115 of beta chain, 
and amino acid residues 91-95 and 111-116 of the 
gamma chain. Kabat et al., supra, p. 279. 

Integrins And Adhesins 

Adhesive proteins involved in cell 
attachment are members of a large family of related 
proteins termed integrins. Integrins are 
heterodimers comprised of a beta and an alpha 
subunit. Members of the integrin family include 
the cell surface glycoproteins platelet receptor 
GpIIb-IIIa, vitronectin, receptor (VnR) fibr once tin 
receptor (FnR) and the leukocyte adhesion receptors 
LFA-1, Mac-1, Mo-1 and 60.3. Roushahti et al., 
science, 238:491-497 (1987). Nucleic acid and 
protein sequence data demonstrates regions of 
conserved sequences exist in the members of these 
families particularly between the beta chain of 
GpIIb-IIIa VnR and FnR, and between the alpha 
subunit of VnR, Mac-1, LFA-1, Fnr and GpIIb-IIIa. 
Suzuki et al., Proc. Natl. Acad. Sci. USA. 83:8514- 
8618, 1986; Ginsberg et al., J. Biol. Chem. . 
262:5437-5440, 1987. 

The following discussion illustrates the 
method of the present invention applied to 
isolating a conserved receptor-coding gene from the 
immunoglobulin gene repertoire. This discussion is 
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not to be taken as limiting, but rather as 
illustrating application of principles that can be 
used to isolate a gene from any family of conserved 
genes coding for functionally related receptors. 

Generally, the method combines the 
following elements: 

1. Isolating nucleic acids 
containing a substantial portion of the 
immunological repertoire. 

2. Preparing polynucleotide 
primers for cloning polynucleotide segments 
containing immunoglobulin V H and V L region genes. 

3. Preparing a gene library 
containing a plurality of different V H and V L genes 
from the repertoire. 

4. Expressing the V H and V L 
polypeptides in a suitable host, including 
prokaryotic and eukaryotic hosts, either separately 
or in the same cell, and either on the same or 
different expression vectors. 

5. Screening the expressed 
polypeptides for the preselected activity, and 
segregating a V H - and V L -coding gene combination 
identified by the screening process. 

An antibody produced by the present 
invention assumes a conformation having a binding 
site specific for as evidenced by its ability to be 
competitively inhibited, a preselected or 
predetermined ligand such as an antigen, enzymatic 
substrate and the like. In one embodiment, an 
antibody of this invention forms an antigen binding 
site which specifically binds to a preselected 
antigen to form an immunoreaction product (complex) 
having a sufficiently strong binding between the 
antigen and the binding site for the immunoreaction 
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product to be isolated. The antibody typically has 
an affinity or avidity is generally greater than 
10 5 - M* 1 more usually greater than 10 6 and preferably 
greater than 10 s M" 1 . 
5 In another embodiment,- an antibody of the 

subject invention binds a substrate and catalyzes 
the formation of a product from the substrate. 
While the topology of the ligand binding site of a 
catalytic antibody is probably more important for 

10 its preselected activity than its affinity 

(association constant or pKa) for the substrate, 
the subject catalytic antibodies have an 
association constant for the preselected substrate 
generally greater than 10 3 M" 1 , more usually greater 

15 than 10 s M" 1 or 10 6 M" 1 and preferably greater than 

10 7 M" 1 . 

Preferably the antibody produced by the 
subject invention is heterodimeric and is therefore 
normally comprised of two different polypeptide 

20 chains, which together assume a conformation having 

a binding affinity, or association constant for the 
preselected antibody that is different, preferably 
higher, than the affinity or association constant 
of either of the polypeptides alone, i.e., as 

25 monomers. One or both of the different polypeptide 
chains is derived from the variable region of the 
light and heavy chains of an immunoglobulin. 
Typically, polypeptides comprising the light (V L ) 
and heavy (V H ) variable regions are employed 

30 together for binding the preselected antibody. 

A V H or V L produced by the subject 
invention can be active in monomeric as well as 
multimeric forms, either homomeric or heteromeric, 
preferably heterodimeric. A V H and V L ligand 

35 binding polypeptide produced by the present 
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invention can be advantageously combined in a 
heterodimer (antibody molecule) to modulate the 
activity of either or to produce an activity unique 
to the heterodimer. The individual ligand binding 
5 polypeptides will be referred to as V H and V L and 

the heterodimer will be referred to as an antibody 
molecule. 

However, it should be understood that a V H 
binding polypeptide may contain in addition to the 

10 V H , substantially all or a portion of the heavy 

chain constant region. A V L binding polypeptide may 
contain, in addition to the V L , substantially all or 
a portion of the light chain constant region. A 
heterodimer comprised of a V H binding polypeptide 

15 containing a portion of the heavy chain constant 

region and a V L binding containing substantially all 
of the light chain constant region is termed a Fab 
fragment. The production of Fab can be 
advantageous in some situations because the 

20 additional constant region sequences contained in a 

Fab as compared to a F v could stabilize the V H and 
V L interaction. Such stabilization could cause the 
Fab to have higher affinity for antigen. In 
addition the Fab is more commonly used in the art 

25 and thus there are more commercial antibodies 
available to specifically recognize a Fab. 

The individual V„ and V L polypeptides will 
generally have fewer than 125 amino acid residues, 
more usually fewer than about 120 amino acid 

30 residues, while normally having greater than 60 

amino acid residues, usually greater than about 95 
amino acid residues, more usually greater than 
about 100 amino acid residues. Preferably, the V H 
will be from about 110 to about 125 amino acid 

35 residues in length while V L will be from about 95 to 



about 115 amino acid residues in length. 

The amino acid residue sequences will 
vary widely, depending upon the particular idiotype 
involved. Usually, there will be at least two 
cysteines separated by from about 60 to 75 amino 
acid residues and joined by a disulfide bond. The 
polypeptides produced by the subject invention will 
normally be substantial copies of idiotypes of the 
variable regions of the heavy and/or light chains 
of immunoglobulins, but in some situations a 
polypeptide may contain random mutations in amino 
acid residue sequences in order to advantageously 
improve the desired activity. 

In some situations, it is desirable to 
provide for covalent cross linking of the V H and V L 
polypeptides, which can be accomplished by 
providing cysteine resides at the carboxyl termini. 
The polypeptide will normally be prepared free of 
the immunoglobulin constant regions, however a 
small portion of the J region may be included as a 
result of the advantageous selection of DNA 
synthesis primers. The D region will normally be 
included in the transcript of the V H . 

In other situations, it is desirable to 
provide a peptide linker to connect the V L and the 
V K to form a single-chain antigen-binding protein 
comprised of a V H and a V L . This single-chain 
antigen-binding protein would be synthesized as a 
single protein chain. Such single-chain antigen- 
binding proteins have been described by Bird et 
al., Science . 242:423-426 (1988). The design of 
suitable peptide linker regions is described in 
U.S. Patent No. 4,704,692 by Robert Landner. 

Such a peptide linker could be designed 
as part of the nucleic acid sequences contained in 



24 

the expression vector. The nucleic acid sequences 
coding for the peptide linker would be between the 
v H and V L DNA homologs and the restriction 
endonuclease sites used to operatively link the V H 
an V L DNA homologs to the expression vector. 

Such a peptide linker could also be coded 
for nucleic acid sequences that are part of the 
polynucleotide primers used to prepare the various 
gene libraries. The nucleic acid sequence coding 
for the peptide linker can be made up of nucleic 
acids attached to one of the primers or the nucleic 
acid sequence coding for the peptide linker may be 
derived from nucleic acid sequences that are 
attached to several polynucleotide primers used to 
create the gene libraries. 

Typically the C terminus region of the V H 
and V L polypeptides will have a greater variety of 
the sequences than the N terminus and, based on the 
present strategy, can be further modified to permit 
a variation of the normally occurring V H and V L 
chains. A synthetic polynucleotide can be employed 
to vary one or more amino in an hypervariable 
region. 

1. Isolation Of The Repertoire 
To prepare a composition of nucleic acids 
containing a substantial portion of the 
immunological gene repertoire, a source of genes 
coding for the V H and/or V L polypeptides is 
required. Preferably the source will be a 
heterogeneous population of antibody producing 
cells, i.e. B lymphocytes (B cells), preferably 
rearranged B cells such as those found in the 
circulation or spleen of a vertebrate. (Rearranged 
B cells are those in which immunoglobulin gene 
translocation, i.e., rearrangement, has occurred as 
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evidenced by the presence in the cell of mRNA with 
the immunoglobulin gene V, D and J region 
transcripts adjacently located thereon.) 

In some cases, it is desirable to bias 
the repertoire for a preselected activity, such as 
by using as a source of nucleic acid ceils (source 
cells) from vertebrates in any one of various 
stages of age,- health and immune response. For 
example, repeated immunization of a healthy animal 
prior to collecting rearranged B cells results in 
obtaining a repertoire enriched for genetic 
material producing a iigand binding polypeptide of 
high affinity. Conversely, collecting rearranged B 
cells from a healthy animal whose immune system has 
not been recently challenged results in producing a 
repertoire that is not biased towards the 
production of high affinity V H and/or V L 
polypeptides . 

It should be noted the greater the 
genetic heterogeneity of the population of cells 
for which the nucleic acids are obtained, the 
greater the diversity of the immunological 
repertoire that will be made available for 
screening according to the method of the present 
invention. Thus, cells from different individuals, 
particularly those having an immunologically 
significant age difference, and cells from 
individuals of different strains, races or species 
can be advantageously combined to increase the 
heterogeneity of the repertoire. 

Thus, in one preferred embodiment, the 
source cells are obtained from a vertebrate, 
preferably a mammal, which has been immunized or 
partially immunized with an antigenic ligand 
(antigen) against which activity is sought, i.e., a 
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preselected antigen. The immunization can be 
carried cut conventionally. Antibody titer in the 
animal can be monitored to determine the stage of 
immunization desired, which stage corresponds to 
the amount of enrichment or biasing of the 
repertoire desired. Partially immunized animals 
typically receive only one immunization and cells 
are collected therefrom shortly after a response is 
detected. Fully immunized animals display a peak 
titer, which is achieved with one or more repeated 
injections of the antigen into the host mammal, 
normally at 2 to 3 week intervals. Usually three 
to five days after the last challenge, the spleen 
is removed and the genetic repertoire of the 
spleenocytes, about 90% of which are rearranged B 
cells, is isolated using standard procedures. See, 
Current Protocols in Molecul ar Biology. Ausubel et 
al., eds., John Wiley & Sons, NY. Nucleic acids 
coding for V H and V L polypeptides can be derived 
from cells producing IgA, IgD, IgE, IgG or IgM, 
most preferably from IgM and IgG, producing cells. 

Methods for preparing fragments of 
genomic DMA from which immunoglobulin variable 
region genes can be cloned as a diverse population 
are well known in the art. See for example 
Herrmann et al., Methods In Enzvmol . . 152:180-183, 
(1987); Frischauf, Methods In Enzvmol . . 152:183-190 
(1987); Frischauf, Methods In Enzvmol. . 152=190-199 
(1987); and DiLella et al., Methods In Enzymoi . r 
152:199-212 (1987). (The teachings of the 
references cited herein are hereby incorporated by 
reference. ) 

The desired gene repertoire can be 
isolated from either genomic material containing 
the gene expressing the variable region or the 
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messenger RNA (mRNA) which represents a transcript 
of the variable region. The difficulty in using 
the genomic DNA from other than non-rearranged B 
lymphocytes is in juxtaposing the sequences coding 
for the variable region, where the sequences are 
separated by introns . The DNA fragment (s) 
containing the proper exons must be isolated, the 
introns excised, and the exons then spliced in the 
proper order and in the proper orientation. For 
the most part, this will be difficult, so that the 
alternative technique employing rearranged B cells 
will be the method of choice because the C D and J 
immunoglobulin gene regions have translocated to 
become adjacent, so that the sequence is continuous 
(free of introns) for the entire variable regions. 

Where mRNA is utilized the cells will be 
lysed under RNase inhibiting conditions. In one 
embodiment, the first step is to isolate the total 
cellular mRNA by hybridization to an oligo-dT 
cellulose column , The presence of mRNAs coding for 
the heavy and/or light chain polypeptides can then 
be assayed by hybridization with DNA single strands 
of the appropriate genes. Conveniently, the 
sequences coding for the constant portion of the v H 
and V L can be used as polynucleotide probes, which 
sequences can be obtained from available sources. 
See for example, Early and Hood, Genetic 
Engineering . Setlow and Hollaender, eds., Vol. 3, 
Plenum Publishing Corporation, New York, (1981) , 
pages 157-188; and Kabat et al., Seguences of 
Immunological Interest. National Institutes of 
Health, Bethesda, MD, (1987). In preferred 

embodiments, the preparation containing the total 
cellular mRNA is first enriched for the presence of 
V H and/ or V L coding mRNA. Enrichment is typically 
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accomplished by subjecting the total mRNA 
preparation or partially purified inKNA product 
thereof to a primer extension reaction employing a 
polynucleotide synthesis primer of the present 
invention. 

2. Preparation Of Polynucleotide 

Primers 

The term "polynucleotide" as used herein 
in reference to primers, probes and nucleic acid 
fragments or segments to be synthesized by primer 
extension is defined as a molecule comprised of two 
or more deoxyribonucleotides or ribonucleotides , 
preferably more than 3. Its exact size will depend 
on many factors, which in turn depends on the 
ultimate conditions of use. 

The term "primer" as used herein refers 
to a polynucleotide whether purified from a nucleic 
acid restriction digest or produced synthetically, 
which is capable of acting as a point of initiation 
of synthesis when placed under conditions in which 
synthesis of a primer extension product which is 
complementary to a nucleic acid strand is induced, 
i.e., in the presence of nucleotides and an agent 
for polymerization such as DNA polymerase, reverse 
transcriptase and the like, and at a suitable 
temperature and pH. The primer is preferably 
single stranded for maximum efficiency, but may 
alternatively be double stranded. If double 
stranded, the primer is first treated to separate 
its strands before being used to prepare extension 
products. Preferably, the primer is a 
polydeoxyribonucleotide. The primer must be 
sufficiently long to prime the synthesis of 
extension products in the presence of the agents 



for polymerization. The exact lengths of the 
primers will depend on may factors, including 
temperature and the source of primer. For example, 
depending on the complexity of the target sequence, 
a polynucleotide primer typically contains 15 to 25 
or more nucleotides, although it can contain fewer 
nucleotides. Short primer molecules generally 
require cooler temperatures to form sufficiently 
stable hybrid complexes with template. 

The primers used herein are selected to 
be "substantially" complementary to the different 
strands of each specific sequence to be synthesized 
or amplified. This means that the primer must be 
sufficiently complementary to non-randomly 
hybridize with its respective template strand. 
Therefore, the primer sequence may not reflect the 
exact sequence of the template. For example, a 
non-complementary nucleotide fragment can be 
attached to the 5' end of the primer, with the 
remainder of the primer sequence being 
substantially complementary to the strand. Such 
noncomplementary fragments typically code for an 
endonuclease restriction site. Alternatively, 
noncomplementary bases or longer sequences can be 
interspersed into the primer, provided the primer 
sequence has sufficient complementarily with the 
sequence of the strand to be synthesized or 
amplified to non-randomly hybridize therewith and 
thereby form an extension product under 
polynucleotide synthesizing conditions. 

The polynucleotide primers can be 
prepared using any suitable method, such as, for 
example, the phosphotriester on phosphodiester 
methods see Narang et al., Meth. Enzvmol. . 68:90, 
(1979); U.S. Patent No. 4,356,270; and Brown et 
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al., Meth. Enzvmol.. 68:109,- (1979). 

The choice of a primer's nucleotide 
sequence depends on factors such as the distance on 
the nucleic acid from the region coding for the 
5 desired receptor, its hybridization site on the 

nucleic acid relative to any second primer to be 
used, the number of genes in the repertoire it is 
to hybridize to, and the like. 

For example, to produce V H -coding DNA 

10 homologs by primer extension, the nucleotide 

sequence of a primer is selected to hybridize with 
a plurality of immunoglobulin heavy chain genes at 
a site substantially adjacent to the V H -coding 
region so that a nucleotide sequence coding for a 

15 functional (capable of binding) polypeptide is 

obtained. To hybridize to a plurality of different 
V H -coding nucleic acid strands, the primer must be a 
substantial complement of a nucleotide sequence 
conserved among the different strands. Such sites 

20 include nucleotide sequences in the constant 

region, any of the variable region framework 
regions, preferably the third framework region, 
leader region, promoter region, J region and the 
like. 

25 Primers of the present invention may also 

contain a DNA=dependent RNA polymerase promoter 
sequence or its complement. See for example, Krieg 

al., Nucleic Acids Research . 12:7057-70 (1984); 
Studier et al., 189:113-130 (1986); and Molecular 

30 Cloning: A Laboratory Manual. Second Edition . 

Maniatis Et al., eds.. Cold Spring Harbor, NY 
(1989) . 

When a primer containing a DNA-dependent 
RNA polymerase promoter is used the primer is 
35 hybridized to the polynucleotide strand to be 



amplified and the second polynucleotide strand of 
the DNA-dependent RNA polymerase promoter is 
completed using an inducing agent such as E. coii , 
DNA polymerase I, or the Klenow fragment of E. coli 
DNA polymerase I, The complementary RNA 
polynucleotide is then produced by adding an KNA- 
dependent RNA polymerase. The starting 
polynucleotide of an RNA polynucleotide is 
amplified by alternating between the production of 
an RNA polynucleotide and DNA polynucleotide. 

If the V H -coding and V L -coding DNA 
homologs are to be produced by polymerase chain 
reaction (PCR) amplification, two primers must be 
used for each coding strand of nucleic acid to be 
amplified. The first primer becomes part of the 
nonsense (minus or complimentary) strand and 
hybridizes to a nucleotide sequence conserved among 
v H (plus) strands within the repertoire. To produce 
V H coding DNA homologs, first primers are therefore 
chosen to hybridize to (i.e. be complementary to) 
conserved regions within the J region, CHI region, 
hinge region, CH2 region, or CH3 region of 
immunoglobulin genes and the like. To produce a V L 
coding DNA homolog, first primers are chosen to 
hybridize with (i.e. be complementary to) a 
conserved region within the J region or constant 
region of immunoglobulin light chain genes and the . 
like. Second primers become part of the coding 
(plus) strand and hybridize to a nucleotide 
sequence conserved among minus strands. To produce 
the V H -coding DNA homologs, second primers are 
therefore chosen to hybridize with a conserved 
nucleotide sequence at the 5' end of the V H -coding 
immunoglobulin gene such as in that area coding for 
the leader or first framework region. It should be 
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noted that in the amplification of both Y H - and V L - 
coding BNA homologs the conserved 5 s nucleotide 
sequence of the second primer can be complementary 
to a sequence exogenously added using terminal 
deoxynucleotidyl transferase as described by Loh et 
al., Sci. Vol 243:217-220 (1989). One or both of 
the first and second primers can contain a 
nucleotide sequence defining an endonuclease 
recognition site. The site can be heterologous to 
the immunoglobulin gene being amplified and 
typically appears at or near the 5» end of the 
primer. 

3. Preparing a Gene Library 
The strategy used for cloning, i.e., 
substantially reproducing, the V H and/or V L genes 
contained within the isolated repertoire will 
depend, as is well known in the art, on the type, 
complexity, and purity of the nucleic acids making 
up the repertoire. Other factors include whether 
or not the genes are to be amplified and/or 
mutagenized. 

In one strategy, the object is to clone 
the V H - and/or V L -coding genes from a repertoire 
comprised of polynucleotide coding strands, such as 
mRNA and/or the sense strand of genomic DNA. If 
the repertoire is in the form of double stranded ■ 
genomic DNA, it is usually first denatured, 
typically by melting, into single strands. The 
repertoire is subjected to a first primary 
extension reaction by treating (contacting) the 
repertoire with a first polynucleotide synthesis 
primer having a preselected nucleotide sequence. 
The first primer is capable of initiating the first 
primer extension reaction by hybridizing to a 
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nucleotide sequence, preferably at least about 10 
nucleotides in length and more preferably at least 
about 20 nucleotides in length, conserved within 
the repertoire. The first primer is sometimes 
5 referred to herein as the "sense primer" because it 
hybridizes to the coding or sense strand of a 
nucleic acid. In addition, the second primer is 
sometimes referred to herein as the "anti-sense 
primer" because it hybridizes to a non-coding or 
10 anti-sense strand of a nucleic acid, i.e., a strand 

complementary to a coding strand. 

The first primer extension is performed 
by mixing the first primer, preferably a 
predetermined amount thereof, with the nucleic 
15 acids of the repertoire, preferably a predetermined 

amount thereof, to form a first primer extension 
reaction admixture. The admixture is maintained 
under polynucleotide synthesizing conditions for a 
time period, which is typically predetermined, 
20 sufficient for the formation of a first primer 

extension reaction product, thereby producing a 
plurality of different V H -coding DNA homolog 
complements. The complements are then subjected to 
a second primer extension reaction by treating them 
25 with a second polynucleotide synthesis primer 
having a preselected nucleotide sequence. The 
second primer is capable of initiating the second • 
reaction by hybridizing to a nucleotide sequence, 
preferably at least about 10 nucleotides in length 
30 and more preferably at least about 20 nucleotides 

in length, conserved among a plurality of different 
V H -coding gene complements such as those, for 
example, produced by the first primer extension 
reaction. This is accomplished by mixing the 
35 second primer, preferably a predetermined amount 
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thereof, with the complement nucleic acids, 
preferably a predetermined amount thereof, to form 
a second primer extension reaction admixture. The 
admixture is maintained under polynucleotide 
5 synthesizing conditions for a time period, which is 
typically predetermined, sufficient for the 
formation of a first primer extension reaction 
product, thereby producing a gene library 
containing a plurality of different V H -and/or V L - 
10 coding DNA homologs. 

A plurality of first primer and/or a 
plurality of second primers can be used in each 
amplification, or an individual pair of first and 
second primers can be used. In any case, the 
15 amplification products of amplifications using the 

same or different combinations of first and second 
primers can be combined to increase the diversity 
of the gene library. 

In another strategy, the object is to 
20 clone the V H - and/or V L -ccding genes from a 

repertoire by providing a polynucleotide complement 
of the repertoire, such as the anti-sense strand of 
genomic dsDNA or the polynucleotide produced by 
subjecting mRNA to a reverse transcriptase 
25 reaction. Methods for producing such complements 

are well known in the art. The complement is 
subjected to a primer extension reaction similar to 
the above-described second primer extension 
reaction, i.e., a primer extension reaction using a 
30 polynucleotide synthesis primer capable of 

hybridizing to a nucleotide sequence conserved 
among a plurality of different V„-coding gene 
complements . 

The primer extension reaction is 
35 performed using any suitable method. Generally it 
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occurs in a buffered aqueous solution, preferably 
at a pH of 7-9, most preferably about 8. 
Preferably, a molar excess (for genomic nucleic 
acid, usually about 10 6 :1 primer: template) of the 
5 primer is admixed to the buffer containing the 

template strand. A large molar excess is preferred 
to improve the efficiency of the process. 

The deoxyribonucleotide triphosphates 
dATP, dCTP, dGTP, and dTTP are also admixed to the 
10 primer extension (polynucleotide synthesis) 

reaction admixture in adequate amounts and the 
resulting solution is heated to about 90C - 10 0C 
for about 1 to 10 minutes, preferably from 1 to 4 
minutes. After this heating period the solution is 
15 allowed to cool to room temperature, which is 

preferable for primer hybridization. To the cooled 
mixture is added an appropriate agent for inducing 
or catalyzing the primer extension reaction, and 
the reaction is allowed to occur under conditions 
20 known in the art. The synthesis reaction may occur 
at from room temperature up to a temperature above 
which the inducing agent no longer functions 
efficiently. Thus, for example, if DNA polymerase 
is used as inducing agent, the temperature is 
25 generally no greater than about 40C. 

The inducing agent may be any compound or 
system which will function to accomplish the 
synthesis of primer extension products, including 
enzymes. Suitable enzymes for this purpose 
30 include, for example, E. coli . DNA polymerase I, 

Klenow fragment of E. coli DNA polymerase I, T4 DNA 
polymerase, other available DNA polymerases, 
reverse transcriptase, and other enzymes, including 
heat-stable enzymes, which will facilitate 
35 combination of the nucleotides in the proper manner 



to form the primer extension products which are 
complementary to each nucleic acid strand. 
Generally, the synthesis will be initiated at the 
3 ' end of each primer and proceed in the 5 1 
direction along the template strand, until 
synthesis terminates, producing molecules of 
different lengths. There may be inducing agents , 
however, which initiate synthesis at the 5 ! end and 
proceed in the above direction, using the same 
process as described above. 

The inducing agent also may be a compound 
or system which will function to accomplish the 
synthesis of RNA primer extension products, 
including the enzymes. In preferred embodiments, 
the inducing agent may be a DNA-dependent RNA 
polymerase such as T7 RNA polymerase, T3 RNA 
polymerase cr SP6 RNA polymerase. These RNA 
polymerases initiate synthesis from a promoter 
contained within a primer of the present invention. 
These polymerase produce a complementary RNA 
polynucleotide. The high turn over rate of the RNA 
polymerase amplifies the starting polynucleotide as 
has been described by Chamber 1 in et al. , The 
Enzymes, ed. , P. Boyer, pp 87-108, Academic Press, 
NY (1982) . Another advantage of T7 RNA polymerase 
is that mutations can be introduced into the 
polynucleotide synthesis by replacing a portion of. 
cDNA with one or more mutagenic 
oliogdeoxynucleotides (polynucleotides) and 
transcribing the partially-mismatched template 
directly as has been previously described by Joyce 
et al. , Nucleic Acid Research, 17:711-722 (1989). 

If the inducing agent is a DNA-dependent 
RNA polymerase and therefore incorporates 
ribonucleotide triphosphates, sufficient amounts of 
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ATP, CTP, GTP, and UTP are admixed to the primer 
extension reaction admixture and the resulting 
solution is treated as described above. 

The newly synthesized strand and its 
5 complementary nucleic acid strand form a double- 
stranded molecule which can be used in the 
succeeding steps of the process. 

The first and/or second primer extension 
reaction discussed above can advantageously be used 
10 to incorporate into the receptor a preselected 

epitope useful in immunologically detecting and/or 
isolating a receptor. This is accomplished by 
utilizing a first and/or second polynucleotide 
synthesis primer or expression vector to 
15 incorporate a predetermined amino acid residue 

sequence into the amino acid residue sequence of 
the receptor. 

After producing V H - and/or V L -coding DNA 
homologs for a plurality of different V H - and/or V L - 
20 coding genes within the repertoire, the homologs 

are typically amplified. While the V H and/or VL- 
coding DNA homologs can be amplified by classic 
techniques such as incorporation into an 
autonomously replicating vector, it is preferred to 
25 first amplify the DNA homologs by subjecting them 
to a polymerase chain reaction (PGR) prior to 
inserting them into a vector. In fact, in 
preferred strategies, the first and/or second 
primer extension reactions used to produce the gene 
30 library are the first and second primer extension 
reactions in a polymerase chain reaction. 

PGR is typically carried out by cycling 
i.e., simultaneously performing in one admixture, 
the above described first and second primer 
35 extension reactions, each cycle comprising 
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polynucleotide synthesis followed by denaturation 
of the double stranded polynucleotides formed. 
Methods and systems for amplifying a DNA homolog 
are described in U.S. Patents No. 4,-683,-195 and 
5 No. 4,683,202, both to Mullis et al. 

In preferred embodiments only one pair of 
first and second primers is used per amplification 
reaction. The amplification reaction products 
obtained from a plurality of different 

10 amplifications, each using a plurality of different 

primer pairs, are then combined. 

However, the present invention also 
contemplates DNA homolog production via co- 
amplification (using two pairs of primers) , and 

15 multiplex amplification (using up to about 8, 9 or 
10 primer pairs) . 

The V H - and V L -coding DNA homologs 
produced by PCR amplification are typically in 
double-stranded form and have contiguous or 

20 adjacent to each of their termini a nucleotide 

sequence defining an endonuclease restriction site. 
Digestion of the V H - and V L -eoding DNA homologs 
having restriction sites at or near their termini 
with one or more appropriate endonucleases results 

25 in the production of homologs having cohesive 

termini of predetermined specificity. In preferred 
embodiments, the PCR process is used not only to 
amplify the V H - and/or V L -coding DNA homologs of the 
library, but also to induce mutations within the 

30 library and thereby provide a library having a 

greater heterogeneity. First, it should be noted 
that the PCR processes itself is inherently 
mutagenic due to a variety of factors well known in 
the art. Second, in addition to the mutation 

35 inducing variations described in the above 
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referenced U.S. Patent No. 4,683,195, other 
mutation inducing PGR variations can be employed. 
For example, the PCR reaction admixture, i.e., the 
combined first and second primer extension reaction 
admixtures, can be formed with different amounts of 
one or more of the nucleotides to be incorporated 
into the extension product. Under such 
conditions, the PCR reaction proceeds to produce 
nucleotide substitutions within the extension 
product as a result of the scarcity of a particular 
base. Similarly, approximately equal molar amounts 
of the nucleotides can be incorporated into the 
initial PCR reaction admixture in an amount to 
efficiently perform X number of cycles, and then 
cycling the admixture through a number of cycles in 
excess of X, such as, for instance, 2X. 
Alternatively, mutations can be induced during the 
PCR reaction by incorporating into the reaction 
admixture nucleotide derivatives such as inosine, 
not normally found in the nucleic acids of the 
repertoire being amplified. During subsequent in 
vivo amplification, the nucleotide derivative will 
be replaced with a substitute nucleotide thereby 
inducing a point mutation. 

4. Expressing the v. and/or v. DNA 
Homoloas. 

The V H . and/or V L -coding DNA homologs 
contained within the library produced by the above- 
described method can be operatively linked to a 
vector for amplification and/or expression. 

As used herein, the term "vector" refers 
to a nucleic acid molecule capable of transporting 
between different genetic environments another ' 
nucleic acid to which it has been operatively 
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linked, one type of preferred vector is an 
episome, i.e., a nucleic acid molecule capable of 
extra-chromosomal replication. Preferred vectors 
are those capable of autonomous replication and/or 
5 expression of nucleic acids to which they are 
linked. Vectors capable of directing the 
expression of genes to which they are operative ly 
linked are referred to herein as "expression 
vectors" . 

10 The choice of vector to which a V H - and/or 

V L -coding DNA homolog is operatively linked depends 
directly, as is well known in the art, on the 
functional properties desired, e.g., replication or 
protein expression, and the host cell to be 

15 transformed, these being limitations inherent in 

the art of constructing recombinant DNA molecules. 

In preferred embodiments, the vector 
utilized includes a procaryotic replicon i.e., a 
DNA sequence having the ability to direct 

20 autonomous replication and maintenance of the 

recombinant DNA molecule extra chromosomally in a 
procaryotic host cell, such as a bacterial host 
cell, transformed therewith. Such replicons are 
well known in the art. In addition, those 

25 embodiments that include a procaryotic replicon 

also include a gene whose expression confers a 
selective advantage, such as drug resistance, to a- 
bacterial host transformed therewith. Typical 
bacterial drug resistance genes are those that 

30 confer resistance to ampicillin or tetracycline. 

Those vectors that include a procaryotic 
replicon can also include a procaryotic promoter 
capable of directing the expression (transcription 
and translation) of the V H - and/or V L -coding 

35 homologs in a bacterial host ceil, such as E. coli 
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transformed therewith. A promoter is an expression 
control element formed by a DNA sequence that 
permits binding of RNA polymerase and transcription 
to occur. Promoter sequences compatible with 
5 bacterial hosts are typically provided in plasmid 
vectors containing convenience restriction sites 
for insertion of a DNA segment of the present 
invention. Typical of such vector plasmids are 
pUC8, pUC9, pBR322, and pBR329 available from 

10 BioRad Laboratories, (Richmond, CA) and pPL and 

pKK223 available from Pharmacia, (Pi scat away, NJ) . 

Expression vectors compatible with 
eucaryotic cells, preferably those compatible with 
vertebrate cells, can also be used. Eucaryotic 

15 cell expression vectors are well known in the art 

and are available from several commercial sources. 
Typically, such vectors are provided containing 
convenient restriction sites for insertion of the 
desired DNA homologue. Typical of such vectors are 

20 pSVL and pKSV=10 (Pharmacia) , pBPV-l/PML2d 

(International Biotechnologies, Inc.), and pTDTl 
(ATCC, No. 31255) . 

In preferred embodiments, the eucaryotic 
cell expression vectors used include a selection 

25 marker that is effective in an eucaryotic cell, 

preferably a drug resistant selection marker. A 
preferred drug resistance marker is the gene whose ■ 
expression results in neomycin resistance, i.e., 
the neomycin phosphotransferase (neo) gene. 

30 Southern et al., J. Mol. AppI. Genet. . 1:327-341 

(1982) . 

The use of retroviral expression vectors 
to express the genes of the V H and/or V L -coding DNA 
homologs is also contemplated. As used herein, the 
35 term "retroviral expression vector" refers to a DNA 
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molecule that includes a promoter sequences derived 
from the long terminal repeat (LTR) region of a 
retrovirus genome. 

In preferred embodiments, the expression 
vector is typically a retroviral expression vector 
that is preferably replication-incompetent in 
eucaryotic cells. The construction and use of 
retroviral vectors has been described by Sorge et 
al., Mol. Cel. Biol. . 4:1730-1737 (1984). 

A variety of methods have been developed 
to operatively link DNA to vectors via 
complementary cohesive termini. For instance, 
complementary cohesive termini can be engineered 
into the V H - and/or V L -coding DNA homo logs during 
the primer extension reaction by use of an 
appropriately designed polynucleotide synthesis 
primer, as previously discussed. The vector, and 
DNA homolog if necessary, is cleaved with a 
restriction endonuclease to produce termini 
complementary to those of the DNA homolog. The 
complementary cohesive termini of the vector and 
the DNA homolog are then operatively linked 
(ligated) to produce a unitary double stranded DNA 
molecule. 

In preferred embodiments, the V K -coding 
and V L -coding DNA homologs of diverse libraries are 
randomly combined in vitro for polycistronic 
expression from individual vectors. That is. a 
diverse population of double stranded DNA 
expression vectors is produced wherein each vector 
expresses, under the control of a single promoter , 
one V H -coding DNA homolog and one V L -eoding DNA 
homolog, the diversity of the population being the 
result of different V H - and V L -coding DNA homolog 
combinations. Random combination in vitro can be 



accomplished using two expression vectors 
distinguished from one another by the location on 
each of a restriction site common to both. 
Preferably the vectors are linear double stranded 
DNA, such as a Lambda Zap derived vector as 
described herein. In the first vector, the site is 
located between a promoter and a polylinker, i.e., 
5 1 terminal (upstream relative to the direction of 
expression) to the polylinker but 3 1 terminal 
(downstream relative to the direction of 
expression) . In the second vector, the polylinker 
is located between a promoter and the restriction 
site. i.e.. the restriction site is located 3 1 
terminal to the polylinker, and the polylinker is 
located 3 1 terminal to the promoter. 

In preferred embodiments, each of the 
vectors defines a nucleotide sequence coding for a 
ribosome binding and a leader, the sequence being 
located between the promoter and the polylinker. 
but downstream (3 " terminal) from the shared 
restriction site if that site is between the 
promoter and polylinker. Also preferred are 
vectors containing a stop codon downstream from the 
polylinker, but upstream from any shared 
restriction site if that site is downstream from 
the polylinker. The first and/or second vector can 
also define a nucleotide sequence coding for a 
peptide tag. The tag sequence is typically located 
downstream from the polylinker but upstream from 
any stop codon that may be present. In preferred 
embodiments, the vectors contain selectable markers 
such that the presence of a portion of that vector, 
i.e. a particular lambda arm, can be selected for 
or selected against. Typical selectable markers 
are well known to those skilled in the art. 
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Examples of such markers are antibiotic resistance 
genes, genetically selectable markers, mutation 
suppressors such as amber suppressors and the like. 
The selectable markers are typically located 
5 upstream of the promoter and/or downstream of the 

second restriction site. In preferred embodiments, 
one selectable marker is located upstream of the 
promoter on the first vector containing the V H - 
coding DNA homologs. A second selectable marker is 

10 located downstream of the second restriction site 

on the vector containing the V L -coding DNA homologs. 
This second selectable marker may be the same or 
different from the first as long as when the V K - 
coding vectors and the V L -coding vectors are 

15 randomly combined via the first restriction site 

the resulting vectors containing both V K and V L and 
both selectable markers can be selected. 

Typically the polylinker is a nucleotide 
sequence that defines one or more, preferably at 

20 least two, restriction sites, each unique to the 

vector and preferably not shared by the other 
vector, i.e., if it is on the first vector, it is 
not on the second vector. The polylinker 
restriction sites are oriented to permit ligation 

25 of Vu- or V L -coding DNA homologs into the vector in 

same reading frame as any leader, tag or stop codon 
sequence present. 

Random combination is accomplished by 
ligating V H -coding DNA homologs into the first 

30 vector, typically at a restriction site or sites 

within the polylinker. Similarly, V L -coding DNA 
homologs are ligated into the second vector, 
thereby creating two diverse populations of 
expression vectors. It does not matter which type 

35 of DNA homolog, i.e., V„ or V L , is ligated to which 
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vector, but it is preferred, for example, that all 
V H -coding DNA homologs are ligated to either the 
first or second vector, and all of the v L -coding DNA 
homologs are ligated to the other of the first or 
second vector. The members of both populations are 
then cleaved with an endonuclease at the shared 
restriction site, typically by digesting both 
populations with the same enzyme. The resulting 
product is two diverse populations of restriction 
fragments where the members of one have cohesive 
termini complementary to the cohesive termini of 
the members of the other. The restriction 
fragments of the two populations are randomly 
ligated to one another, i.e., a random, 
interpopulation ligation is performed, to produce a 
diverse population of vectors each having a V H - 
coding and V L -ccding DNA homolog located in the same 
reading frame and under the control of second 
vector's promoter. Of course, subsequent 
recombinations can be effected through cleavage at 
the shared restriction site, which is typically 
reformed upon ligation of members from the two 
populations, followed by subsequent religations. 

The resulting construct is then 
introduced into an appropriate host to provide 
amplification and/or expression of the V H = and/or 
V L -coding DNA homologs, either separately or in 
combination. When coexpressed within the same 
organism, either on the same or the different 
vectors, a functionally active Fv is produced. 
When the V H and V L polypeptides are expressed in 
different organisms, the respective polypeptides 
are isolated and then combined in an appropriate 
medium to form a Fv. Cellular hosts into which a 
V H - and/or V L -coding DNA homolog-containing 
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construct has been introduced are referred to 
herein as having been " trans formed " or as 
"transf ormants" . 

The host cell can be either procaryotic 
5 or eucaryctic. Bacterial cells are preferred 

procaryotic host cells and typically are a strain 
of E. coli such as. for example, the E. coli strain 
DK5 available from Bethesda Research Laboratories, 
Inc. , Bethesda, MD. Preferred eucaryotic host 

10 cells include yeast and mammalian cells, preferably 

vertebrate cells such as those from a mouse, rat, 
monkey or human cell line. 

Transformation of appropriate cell hosts 
with a recombinant DNA molecule of the present 

15 invention is accomplished by methods that typically 
depend on the type of vector used. With regard to 
transformation of procaryotic host cells, see, for 
example, Cohen et al., Proceedings National Academy 
of Science . USA, Vol. 69, P. 2110 (1972); and 

20 Maniatis et al., Molecular Cloning, a Laboratory 

Manual . Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1982) . With regard to the 
transformation of vertebrate cells with retroviral 
vectors containing rDNAs, see for example, Sorge et 

25 al., Mol. Cell. Biol. . 4:1730-1737 (1984); Graham 

et al., Virol. . 52:456 (1973); and Wigler et al., 
Proceedings National Acade my of Sciences . USA, Vol. 
76, P. 1373-1376 (1979). 

30 5. Screening For Expr ession of V„ and /or 

V. Polypeptides 
Successfully transformed cells, i.e., 
cells containing a V H - and/or V L -coding DNA homolog 
operatively linked to a vector, can be identified 
35 by any suitable well known technique for detecting 
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the binding of a receptor to a ligand or the 
presence of a polynucleotide coding for the 
receptor, preferably its active site. Preferred 
screening assays are those where the binding of 
5 ligand by the receptor produces a detectable 
signal, either directly or indirectly, such 
signals include, for example, the production of a 
complex, formation of a catalytic reaction product, 
the release or uptake of energy, and the like. Tor 

10 example, cells from a population subjected to 

transformation with a subject rDNA can be cloned to 
produce monoclonal colonies. Cells form those 
colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using 

15 a method such as that described by Southern, J_s_ 
Mol. Biol. . 98:503 (1975) or Berent et al., 
Biotech . 3:203 (1985). 

In addition to directly assaying for the 
presence of a V H - and/or V L -coding DNA homolog, 

20 successful transformation can be confirmed by well 

known immunological methods, especially when the V H 
and/or V L polypeptides produced contain a 
preselected epitope. For example, samples of cells 
suspected of being transformed are assayed for the 

25 presence of the preselected epitope using an 
antibody against the epitope. 

6. V H - And/Or V,-Codina: Gene Libraries 
The present invention contemplates a gene 
30 library, preferably produced by a primer extension 
reaction or combination of primer extension 
reactions as described herein, containing at least 
about 10 3 , preferably at least about 10 4 and more 
preferably at least about 10 5 different V H - and/or 
35 V L -coding DNA homologs. The homologs are preferably 
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in an isolated form, that is, substantially free of 
materials such as, for example, primer extension 
reaction agents and/or substrates, genomic DNA 
segments, and the like. 

In preferred embodiments, a substantial 
portion of the homologs present in the library are 
operatively linked to a vector, preferably 
operatively linked for expression to an expression 
vector. 

Preferably, the homologs are present in a 
medium suitable for in vitro manipulation, such as 
water, water containing buffering salts, and the 
like. The medium should be compatible with 
maintaining the biological activity of the 
homologs. In addition, the homologs should be 
present at a concentration sufficient to allow 
transformation of a host cell compatible therewith 
at reasonable frequencies. 

It is further preferred that the homologs 
be present in compatible host cells transformed 
therewith. 

D. Expression vectors 

The present invention also contemplates 
various expression vectors useful in performing, 
inter alia , the methods of the present invention. 
Each of the expression vectors is a novel 
derivative of Lambda Zap. 

1. Lambda zap II 

Lambda Zap II is prepared by 
replacing the Lambda S gene of the vector Lambda 
Zap with the Lambda S gene from the Lambda gtlO 
vector, as described in Example 6. 

2. Lambda Zap II V H 



Lambda Zap II V H is prepared by 



inserting the synthetic DNA sequences illustrated 
in Figure 6A into the above-described Lambda Zap II 
vector. The inserted nucleotide sequence 
advantageously provides a ribosome binding site 
(Shine-Dalgarno sequence) to permit proper 
imitation of mRNA translation into protein, and a 
leader sequence to efficiently direct the 
translated protein to the periplasm. The 
preparation of Lambda Zap II V H is described in more 
detail in Example 9, and its features illustrated 
in Figures 6A and 7. 



described in Example 12 by inserting into Lambda 
Zap II the synthetic DNA sequence illustrated in 
Figure 6B= Important features of Lambda Zap II V L 
are illustrated in Figure 8. 



described in Example 11 by inserting into Lambda 
Zap II the synthetic DNA sequence illustrated in 

Figure 10= 



compatible with E. coli hosts, i.e., they can 
express for secretion into the periplasm proteins 
coded for by genes to which they have been 
operatively linked for expression. 

Examples 

The following examples are intended to 
illustrate, but not limit, the scope of the 
invention. 



3. Lambda Zap II V L 

Lambda Zap II V L is prepared as 




Lambda Zap II V L II is prepared as 



The above-described vectors are 



1. Polynucleotide Selection 
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The nucleotide sequences encoding the 
immunoglobulin protein CDR's are highly variable. 
However, there are several regions of conserved 
sequences that flank the V„ domains. For instance, 
5 contain substantially conserved nucleotide 

sequences, i.e., sequences that will hybridize to 
the same primer sequence. Therefore, 
polynucleotide synthesis (amplification) primers 
that hybridize to the conserved sequences and 

10 incorporate restriction sites into the DNA homolog 

produced that are suitable for operatively linking 
the synthesized DNA fragments to a vector were 
constructed. More specifically, the DNA homologs 
were inserted into Lambda ZAP II vector (Stratagene 

15 Cloning System, San Diego, CA) at the Xho I and 

EcoR I sites. For amplification of the V H domains, 
the 3 • primer (primer 12 in Table 1), was designed 
to be complementary to the mRNA in the J H region. 
In all cases, the 5' primers (primers 1-10. Table 

20 1) were chosen to be complementary to the first 

strand cDNA in the conserved N-terminus region 
(antisense strand) . Initially amplification was 
performed with a mixture of 32 primers (primer 1, 
Table 1) that were degenerate at five positions. 

25 Hybridoma mRNA could be amplified with mixed 

primers, but initial attempts to amplify mRNA from 
spleen yielded variable results. Therefore, 
several alternatives to amplification using the 
mixed 5 • primers were compared. 

30 The first alternative was to construct 

multiple unique primers, eight of which are shown 
in Table 1, corresponding to individual members of 
the mixed primer pool. The individual primers 2-9 
of Table 1 were constructed by incorporating either 

35 of the two possible nucleotides at three of the 
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five degenerate positions. 

The second alternative was to construct a 
primer containing inosine (primer 10, Table l) at 
four of the variable positions based on the 
published work of Takahashi, et al . , Proc. Natl. 
Acad. Sci. (U.S.A. \ . 82:1931-1935, (1985) and 
Ohtsuka et al., J. Biol. Chem. . 260: 2605-2608, 
(1985) . This primer has the advantage that it is 
not degenerate and, at the same time minimizes the 
negative effects of mismatches at the unconserved 
positions as discussed by Martin et al., Nuc . Acids 
Res. . 13:8927 (1985). However, it was not known if 
the presence of inosine nucleotides would result in 
incorporation of unwanted sequences in the cloned V H 
regions. Therefore, inosine was not included at 
the one position that remains in the amplified 
fragments after the cleavage of the restriction 
sites. As a result, inosine was not in the cloned 
insert . 

Additional, V H amplification primers 
including the unique 3 ' primer were designed to be 
complementary to a portion of the first constant 
region domain of the gamma 1 heavy chain mRNA 
(primers 15 and 16, Table 1). These primers will 
produce DNA homologs containing polynucleotides 
coding for amino acids from the V H and the first 
constant region domains of the heavy chain. These . 
DNA homologs can therefore be used to produce Fab 
fragments rather than an F v . 

As a control for amplification from 
spleen or hybridoma mRNA, a set of primers 
hybridizing to a highly conserved region within the 
constant region IgG, heavy chain gene were 
constructed. The 5 1 primer (primer 11, Table 1) is 
complementary to the cDNA in the C H 2 region whereas 
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the 3' primer (primer 13, Table 1) is complementary 
to the mRNA in the C H 3 region. It is believed that 
no mismatches were present between these primers 
and their templates. 
5 The nucleotide sequences encoding the V L 

CDRs are highly variable. However, there are 
several regions of conserved sequences that flank 
the V L CDR domains including the J L , V L framework 
regions and V L leader/promotor. Therefore, 

10 amplification primers that hybridize to the 

conserved sequences and incorporate restriction 
sites that allowing cloning the amplified fragments 
into the pBluescript SK- vector cut with Nco I and 
Spe I were constructed. For amplification of the V L 

15 CDR domains, the 3' primer (primer 14 in Table 1), 

was designed to be complementary to the mRNA in the 
J L regions. The 5 1 primer (primer 15, Table 1) was 
chosen to be complementary to the first strand cDNA 
in the conserved N-terminus region (antisense 

20 strand) . 

A second set of amplification primers for 
amplification of the V L CDR domains the 5' primers 
(primers 1-8 in Table 2) were designed to be 
complementary to the first strand cDNA in the 

25 conserved N-terminus region. These primers also 

introduced a Sac I restriction endonuclease site to 
allow the V L DNA homolog to be cloned into the V L II-. 
expression vector. The 3« V L amplification primer 
(primer 9 in Table 2) was designed to be 

30 complementary to the mRNA in the J L regions and to 

introduce the Xba I restriction endonuclease site 
required to insert the V L DNA homolog into the V L II= 
expression vector (Figure a) . 

Additional 3' V L amplification primers 

35 were designed to hybridize to the constant region 
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of either kappa or lambda mRNA (primers 10 and 11 
in Table 2) , These primers allow a DNA homolog to 
be produced containing polynucleotide sequences 
coding for constant region amino acids of either 
kappa or lambda chain. These primers make it 
possible to produce an Fab fragment rather than an 
F v . 

The primers used for amplification of 
kappa light chain sequences for construction of 
Fabs are shown at least in Table 2. Amplification 
with these primers was performed in 5 separate 
reactions, each containing one of the 5 s primers 
(primers 3-6, and 12) and one of the 3 1 primers 
(primer 13), The remaining 3' primer (primer 9) 
has been used to construct F v fragments. The 5 1 
primers contain a Sac I restriction site and the 3' 
primers contain a Xba I restriction site. 

The primers used for amplification of 
heavy chain Fd fragments for construction of Fabs 
are shown at least in Table 1. Amplification was 
performed in eight separate reactions, each 
containing one of the 5' primers (primers 2-9) and 
one of the 3' primers (primer 15). The remaining 
5 ! primers that have been used for amplification in 
a single reaction are either a degenerate primer 
(primer 1) or a primer that incorporates inosine at 
four degenerate positions (primer 10, Table 1, and . 
primers 17 and 18, Table 2). The remaining 3* 
primer (primer 14, Table 2) has been used to 
construct F v fragments. Many of the 5' primers 
incorporate a Xho I site, and the 3' primers 
include a Spe I restriction site. 

v L amplification primers designed to 
amplify human light chain variable regions of both 
the lambda and kappa isotypes are also shown in 
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Table 2. 

All primers and synthetic polynucleotides 
used herein and shown on Tables 1-4 were either 
purchased from Research Genetics in Huntsville. 
5 Alabama or synthesized on an Applied Biosystems BNA 
synthesizer, model 381A, using the manufacturer's 
instruction. 
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2. Production Of A V H Coding Repertoire 

Enriched In FITC Binding Proteins 
Fluorescein isothiocyanate (FITC) was 
selected as a ligand for receptor binding. It was 
5 further decided to enrich by immunization the 

immunological gene repertoire, i.e., V H - and V L - 
coding gene repertoires, for genes coding for anti- 
FITC receptors. This was accomplished by linking 
FITC to keyhole limpet hemocyanin (KLK) using the 
10 technigues described in Antibodies A Laboratory 

Manual . Harlow and Lowe,- eds. , Cold Spring Harbor, 
New York, (1988). Briefly, iO.O milligrams (mg) of 
keyhole limpet hemocyanin and 0.5 mg of FITC were 
added to 1 ml of buffer containing 0.1 M sodium 
15 carbonate at pH 9.6 and stirred for 18 to 24 hours 

at 4 degrees C (4C) . The unbound FITC was removed 
by gel filtration through Sephadex G-25; 

The KLH-FITC conjugate was prepared for 
injection into mice by adding 100 ng of the 
20 conjugate to 250 ul of phosphate buffered saline 

(FB5) . An equal volume of complete Freund's 
adjuvant was added and emulsified the entire 
solution for 5 minutes. A 129 G IX+ mouse was 
injected with 300 Mi of the emulsion. Injections 
25 were given subcutaneous ly at several sites using a 

21 gauge needle, a second immunization with KLH- 
FITC was given two weeks later. This injection was 
prepared as follows: fifty ng of KLH-FITC were 
diluted in 250 «L of PBS and an equal volume of 
30 alum was admixed to the KLH-FITC solution. The 

mouse was injected intraperitoneal ly with 500 fil of 
the solution using a 23 gauge needle. One month 
later the mice were given a final injection of 50 
fxq of the KLH-FITC conjugate diluted to 2 00 /zL in 
35 PBS. This injection was given intravenously in the 
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lateral tail vein using a 30 gauge needle. Five 
days after This final injection the mice were 
sacrificed and total cellular RNA was isolated from 
their spleens. 

5 Hybridoma PCP 8B11 producing an antibody 

immunospecif ic for phosphonate ester was cultured 
in DMEM media (Gibco Laboratories. Grand Island. 
New York) containing 10 percent fetal calf serum 
supplemented with penicillin and streptomycin. 
10 About 5 x 10 8 hybridoma cells were harvested and 

washed twice in phosphate buffered saline. Total 
cellular RNA was prepared from these isolated 
hybridoma cells. 

15 3. Preparation Of A Vu-Codinq Gene 

Repertoire 
Total cellular RNA was prepared from the 
spleen of a single mouse immunized with KLH-FITC as 
described in Example 2 using the RNA preparation 

20 methods described by Chomczynski et al., Anal 

Biochem. . 162:156-159 (1987) using the 
manufacturer's instructions and the RNA isolation 
kit produced by Stratagene Cloning Systems, La 
Jolla, CA. Briefly, immediately after removing the 

25 spleen from the immunized mouse, the tissue was 
homogenized in 10 ml of a denaturing solution 
containing 4.0 M guanine isothiocyanate, 0.25 M 
sodium citrate at pH 7.0, and 0.1 M 2- 
mercaptoethanol using a glass homogeniser. One ml 

30 of sodium acetate at a concentration of 2 M at pH 

4.0 was admixed with the homogenized spleen. One 
ml of phenol that had been previously saturated 
with H 2 0 was also admixed to the denaturing solution 
containing the homogenized spleen. Two ml of a 

35 chloroform: isoamyl alcohol (24:1 v/v) mixture was 
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added to this homogenate. The homogenate was mixed 
vigorously for ten seconds and maintained on ice 
for 15 minutes. The homogenate was then 
transferred to a thick-walled 50 ml polypropylene 
centrifuged tube (Fisher Scientific Company, 
Pittsburg, PA) . The solution was centrifuged at 
10,000 x g for 20 minutes at AC. The upper RNA- 
containing aqueous layer was transferred to a fresh 
50 mi polypropylene centrifuge tube and mixed with 
an equal volume of isopropyl alcohol. This 
solution was maintained at -20C for at least one 
hour to precipitate the RNA. The solution 
containing the precipitated RNA was centrifuged at 
10,000 x g for twenty minutes at 4C. The pelleted 
total cellular RNA was collected and dissolved in 3 
ml of the denaturing solution described above. 
Three ml of isopropyl alcohol was added to the 
resuspended total cellular RNA and vigorously 
mixed. This solution was maintained at -20C for at 
least 1 hour to precipitate the RNA. The solution 
containing the precipitated RNA was centrifuged at 
10,000 x g for ten minutes at 4C. The pelleted RNA 
was washed once with a solution containing 75% 
ethanol. The pelleted RNA was dried under vacuum 
for 15 minutes and then resuspended in dimethyl 
pyrocarbonate (DEPC) treated (DEPC-K 2 0) K 2 0. 

Messenger RNA (mRNA) enriched for 
sequences containing long poly A tracts was 
prepared from the total cellular RNA using methods 
described in Molecular Cloning A Laboratory Manual. 
Maniatias et al . , eds . , Cold Spring Harbor 
Laboratory, NY, (19S2) . Briefly, one half of the 
total RNA isolated from a single immunized mouse 
spleen prepared as described above was resuspended 
in one ml of DEPC-K 2 0 and maintained at 65C for five 
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minutes. One ml of 2x high salt loading buffer 
consisting of 100 mM Tris=HCL, 1 M sodium chloride, 
2.0 mM disodium ethylene diamine tetraacetic acid 
(EDTA) at pH 7.5. and 0.2% sodium dodecyl sulfate 
5 (SDS) was added to the resuspended RNA and the 

mixture allowed to cool to room temperature. The 
mixture was then applied to an oligo-dT 
(Collaborative Research Type 2 or Type 3) column 
that was previously prepared by washing the oligo- 

10 dT with a solution containing 0.1 M sodium 

hydroxide and 5 mM EDTA and then equilibrating the 
column with DEPC-H 2 0. The eluate was collected in a 
sterile polypropylene tube and reapplied to the 
same column after heating the eluate for 5 minutes 

15 at 65C. The oligo dT column was then washed with 2 
ml of high salt loading buffer consisting of 50 mM 
Tris-HCL at pH 7.5, 500 mM sodium chloride, 1 mM 
EDTA at pH 7.5 and 0.1% SDS. The oligo dT column 
was then washed with 2 ml of 1 X medium salt buffer 

20 consisting of 50 mM Tris-KCL at pK 7.5, 100 mM 

sodium chloride 1 mM EDTA and 0.1% SDS. The 
messenger RNA was eluted from the oligo dT column 
with 1 ml of buffer consisting of 10 mM Tris-HCL at 
pH 7.5, 1 mM EDTA at pH 7.5 and 0.05% SDS. The 

25 messenger RNA was purified by extracting this 
solution with phenol/chloroform followed by a 
single extraction with 100% chloroform. The 
messenger RNA was concentrated by ethanol 
precipitation and resuspended in DEPC K 2 0. 

30 The messenger RNA isolated by the above 

process contains a plurality of different V K coding 
polynucleotides, i.e., greater than about 10 4 
different V H -coding genes. 

4. Preparation Of A Single V. Coding 
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Polynucleotide 
Polynucleotides coding for a single V H 
were isolated according to Example 3 except total 
cellular RNA was extracted from monoclonal 
hybridoma cells prepared in Example 2. The 
polynucleotides isolated in this manner code for a 
single V H . 

5. DNA Homoioq Preparation 
In preparation for PCR amplification, 
mRNA prepared according to the above examples was 
used as a template for cDNA synthesis by a primer 
extension reaction. In a typical 50 ul 
transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first hybridized 
(annealed) with 500 ng (50.0 pmol) of the 3' V H 
primer (primer 12, Table 1) , at 65C for five 
minutes. Subsequently, the mixture was adjusted to 
1.5 mM dATP, dCTP, dGTP and dTTP, 40 mM Tris-HCl at 
pH 8,0, 8 mM MgCl 2 , 50 mM NaCl, and 2 mM spermidine. 
Moloney-Murine Leukemia virus Reverse transcriptase 
(Stratagene Cloning Systems), 26 units, was added 
and the solution was maintained for 1 hour at 37C. 

PCR amplification was performed in a 100 
ul reaction containing the products of the reverse 
transcription reaction (approximately 5 ug of the 
cDNA/RNA hybrid), 300 ng of 3 ' V H primer (primer 12 
of Table 1), 300 ng each of the 5' V H primers 
(primer 2-10 of Table 1) 200 mM of a mixture of 
dNTP's, 50 mM KC1, 10 mM Tris-HCl pH 8.3, 15 mM 
MgCl 2 , 0.1% gelatin and 2 units of Taq DNA 
polymerase. The reaction mixture was overlaid with 
mineral oil and subjected to 40 cycles of 
amplification. Each amplification cycle involved 
denaturation at 92C for 1 minute, annealing at 52C 
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for 2 minutes and polynucleotide synthesis by 
Primer extension (elongation) at 72C for 1.5 
minutes. The amplified V H -coding DNA homolog 
containing samples were extracted twice with 
5 phenol/ chloroform, once with chloroform, ethanol 

precipitated and were stored at -70C in 10 mM Tris- 
HC1, (pH, 7.5) and 1 mM EDTA. 

Using unique 5 * primers (2-9 , Table 1), 
efficient V H -coding DNA homolog synthesis and 

10 amplification from the spleen mRNA was achieved as 

shown in Figure 3, lanes R17-R24. The amplified 
cDNA (V H -coding DNA homolog) is seen as a major band 
of the expected size (360 bp) . The intensities of 
the amplified V H -coding polynucleotide fragment in 

15 each reaction appear to be similar, indicating that 

all of these primers are about equally efficient in 
initiating amplification. The yield and quality of 
the amplification with these primers was 
reproducible. The primer containing inosine also 

20 synthesized amplified V H =coding DNA homologs from 

spleen mRNA reproducibly, leading to the production 
of the expected sized fragment, of an intensity 
similar to that of the other amplified cDNAs 
(Figure 4, lane R16) . This result indicated that 

25 the presence of inosine also permits efficient DNA 
homolog synthesis and amplification. Clearly 
indicating how useful such primers are in 
generating a plurality of V H -coding DNa homologs. 
Amplification products obtained from the constant 

30 region primers (primers 11 and 13, Table 1) were 

more intense indicating that amplification was more 
efficient, possibly because of a higher degree of 
homology between the template and primers (Figure 
4 , Lane R9 ) . Based on these results , a V„-coding 

35 gene library was constructed from the products of 
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eight amplifications, each performed with a 
different 5' primer. Equal portions of the 
products from each primer extension reaction were 
mixed and the mixed product was then used to 
generate a library of V H -coding DNA homolog- 
containing vectors. 

DNA homologs of the V t were prepared from 
the purified mRNA prepared as described above. In 
preparation for PGR amplification, mRNA prepared 
according to the above examples was used as a 
template for cDNA synthesis. In a typical 50 ul 
transcription reaction, 5-10 ug of spleen or 
hybridoma mRNA in water was first annealed with 300 
ng (50.0 pmol) of the 3' V L primer (primer 14, Table 
1), at 65C for five minutes. Subsequently, the 
mixture was adjusted to 1.5 mM dATP, dCTP, dGTP, 
and dTTP, 40 mM Tris-KCL at pK S.0, 8 mM MgCl 2 , 50 
mM Nacl, and 2 mM spermidine. Moloney-Murine 
Leukemia virus reverse transcriptase (Stratagene 
Cloning Systems), 26 units, was added and the 
solution was maintained for 1 hour at 37C. The PCR 
amplification was performed in a 100 ul reaction 
containing approximately 5 ug of the cDNA/RNA 
hybrid produced as described above, 300 ng of the 
3» V L primer (primer 14 of Table 1), 300 ng of the 
5' V L primer (primer 15 of Table 1), 200 mM of a 
mixture of dNTP's, 50 mM KC1, 10 mM Tris-HCl pH 
8.3, 15 mM MgCl 2 , 0.1% gelatin and 2 units of Tag 
DNA polymerase. The reaction mixture was overlaid 
with mineral oil and subjected to 40 cycles of 
amplification. Each amplification cycle involved 
denaturation at 92C for 1 minute, annealing at 52C 
for 2 minutes and elongation at 72C for 1.5 
minutes. The amplified samples were extracted 
twice with phenol/chloroform, once with chloroform, 
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ethanol precipitated and were stored at -70C in 10 
mM Tris-HCl at 7.5 and i mM EDTA. 

6. Inserting DNA Homoloas Into Vectors 
In preparation for cloning a library 
enriched in V H sequences, PCR amplified products 
(2,5 mg/30 ul of 150 mM NaCl, 8 mM Tris-HCl (pH 
7.5) , 6 mM MgS0 4 , 1 mM DTT, 200 mg/mi bovine serum 
albumin (BSA) at 37C were digested with restriction 
enzymes Xho I (125 units) and EcoR I (10 U) and 
purified on a 1% agarose gel. In cloning 
experiments which required a mixture of the 
products of the amplification reactions, equal 
volumes (50 ul, 1-10 ug concentration) of each 
reaction mixture were combined after amplification 
but before restriction digestion. After gel 
electrophoresis of the digested FCR amplified 
spleen mRNA, the region of the gel containing DNA 
fragments of approximately 350 bps was excised, 
eleetroeluted into a dialysis membrane, ethanol 
precipitated and resuspended in 10 mM Tris-HCl pH 
7.5 and 1 mM EDTA to a final concentration of 10 
ng/ul. Equimolar amounts of the insert were then 
ligated overnight at 5C to 1 ug of Lambda ZAP™ II 
vector (Stratagene Cloning Systems, La Jolla, CA) 
previously cut by EcoR I and Xho I. A portion of 
the ligation mixture (1 ul) was packaged for 2 
hours at room temperature using Gigapack Gold 
packaging extract (Stratagene Cloning Systems, La 
Jolla, CA) , and the packaged material was plated on 
XLl-blue host cells. The library was determined to 
consist of 2 x 10 7 V H homologs with less than 30% 
non-recombinant background. 

The vector used above. Lambda Zap II is a 
derivative of the original Lambda Zap (ATCC # 
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40,298) that maintains all of the characteristics 
of the original Lambda Zap including 6 unique 
cloning sites, fusion protein expression, and the 
ability to rapidly excise the insert in the form of 
a phagemid (Blue-script SK-) , but lacks the SAM 100 
mutation, allowing growth on many Non-Sup P 
strains, including XLl-Blue. The Lambda Zap II was 
constructed as described in Short et al. , Nucleic 
Acids Res. . 16:7583-7600, 1988, by replacing the 
Lambda S gene contained in a 4254 base pair (bp) 
DNA fragment produced by digesting Lambda Zap with 
the restriction enzyme Ncol. This 4254 bp DNA 
fragment was replaced with the 4254 bp DNA fragment 
containing the Lambda S gene isolated from Lambda 
gtlO (ATCC # 40,179) after digesting the vector 
with the restriction enzyme Ncol. The 4254 bp DNA 
fragment isolated from lambda gtlO was ligated into 
the original Lambda Zap vector using T4 DNA ligase 
and standard protocols for such procedures 
described in Current Protocols in Molecular 
Biology, Ausubel et al., eds., John Wiley and Sons, 
New York, 1987. 

In preparation of cloning a library 
enriched in v L sequences, 2 ug of PCR amplified 
products (2.5 mg/30 ul of 150 mM NaCl, 8 mM Tris- 
HCL (pH 7,5), 6 mM Mg S0 4 , 1 mM DTT, 200 mg/ml BSA. 
37C) were digested with restriction enzymes Nco I . 
(30 units) and Spe I (45 units) . The digested PCR 
amplified products were purified on a 1% agarose 
gel using standard eiectroelution technique 
described in Molecular Cloning A Laboratory Manual . 
Maniatis et al., eds., Cold Spring Harbor, New 
York, (1982). Briefly, after gel eiectroelution of 
the digested PCR amplified product the region of 
the gel containing the V L -coding DNA fragment of the 
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appropriate size was excised, electroelution into a 
dialysis membrane , ethanol precipitated and 
resuspended at a final concentration of 10 ng per 
ml in a solution containing 10 mM Tris-HCL at pH 
7.5 and 1 IBM EDTA. 

An equal molar amount of DNA representing 
a plurality of different V L -coding DNA homologs was 
ligated to a pBluescript SK- phagemid vector that 
had been previously cut with Nco I and Spe I. A 
portion of the ligation mixture was transformed 
using the manufacturer's instructions into Epicuian 
Coli XLl-Blue competent cells (Stratagene Cloning 
Systems, La Jolla, CA) . The transformant library 
was determined to consist of 1.2 x 10 3 colony 
forming units/ug of V L homologs with less than 3% 
non- recombinant background. 

7. Sequencing of Plasmids from the 
Coding cDNA Library 

To analyze the Lambda Zap II phage clones 
the clones were excised from Lambda Zap into 
plasmids according to the manufacture's 
instructions (Stratagene Cloning System, La Jolla, 
CA) . Briefly, phage plaques were cored from the 
agar plates and transferred to sterile microfuge 
tubes containing 500 pi a buffer containing 50 mM 
Tris-HCL at pH 7.5, 100 mM Naci, 10 mM MgS0 4 , and 
0.01% gelatin and 20 uL of chloroform. 

For excisions, 200 ul of the phage stock, 
200 ul of XLl-Blue cells (A^o = 1-00) and 1 ul of 
R408 helper phage (1 x 10 11 pfu/ml) were incubated 
at 37C for 15 minutes. The excised plasmids were 
infected into XLl-Blue ceils and plated onto LB 
plates containing ampicillin. Double stranded DNA 
was prepared from the phagemid containing cells 
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according to the methods described by Holmes et 
al,,- Anal, Biochem. . 114:193, (19S1) * Clones were 
first screened for DNA inserts by restriction 
digests with either Pvu II or Bgl I and clones 
containing the putative V H insert were sequenced 
using reverse transcriptase according to the 
general method described by Sanger et al . , Proc. 
Natl, Acad, Sci, . USA . 74:5463-5467, (1977) and the 
specific modifications of this method provided in 
the manufacturer's instructions in the AMV reverse 
transcriptase 35 S-dATP sequencing kit from 
Stratagene Cloning systems, La Jolla, CA. 

8. Characterization Of The Cloned v„ 
Repertoire 

The amplified products which had been 
digested with Xho I and EccR I and cloned into 
Lambda zap, resulted in a cDNA library with 9.0 x 
10 5 pfu's. In order to confirm that the library 
consisted of a diverse population of V H coding DNA 
homologs, the N-terminai 120 bases of 18 clones, 
selected at random from the library, were excised 
and sequenced (Figure 5). To determine if the 
clones were of V H gene origin, the cloned sequences 
were compared with known V H sequences and V L 
sequences. The clones exhibited from 80 to 90% 
homology with sequences of known heavy chain origin 
and little homology with sequences of light chain 
origin when compared with the sequences available 
in Sequences of Proteins of Immunological Interest 
by Kabot et al., 4th ed. , U.S. Dept. of Health and 
Human Sciences, (1987). This demonstrated that the 
library was enriched for the desired V H sequence in 
preference to other sequences, such as light chain 
sequences . 
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The diversity of the population was 
assessed by classifying the sequenced clones into 
predefined subgroups (Figure 5) . Mouse V H sequences 
are classified into eleven subgroups (Figure 5) , 
5 Mouse V H sequences are classified into eleven 

subgroups [I (A, B, ) , II (A,B,C), III (A,B,C,D,) V 
(A,B)] based on framework amino acid sequences 
described in Sequences of Proteins of Im munological 
Interest by Kabot et al. f 4th ed. , U.S. Dept. of 

10 Health and Human Sciences. (1987) ; Dildrop. 

Immunolo gy Today . 5:84, (1984); and Brodeur et al. , 
Eur. J. Immunol. . 14; 922, (1984). Classification 
of the sequenced clones demonstrated that the cDNA 
library contained V H sequences of at least 7 

15 different subgroups. Further, a pairwise 

comparison of the homology between the sequenced 
clones showed that no two sequences were identical 
at all positions, suggesting that the population is 
diverse to the extent that it is possible to 

20 characterize by sequence analysis. 

Six of the clones (L 36-50, Figure 5) 
belong to the subclass III B and had very similar 
nucleotide sequences. This may reflect a 
preponderance of mRNA derived from one or several 

25 related variable genes in stimulated spleen, but 

the data does not permit ruling out the possibility 
of a bias in the amplification process. 

9. Vu-Exoression Vect or Construction 
30 The main criterion used in choosing 

a vector system was the necessity of generating the 
largest number of Fab fragments which could be 
screened directly. Bacteriophage lambda was 
selected as the expression vector for three 
35 reasons. First, in vitro packaging of phage DNA is 
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the most efficient method of reintroducing DNA into 
host cells. Second, it is possible to detect 
protein expression at the level of single phage 
plaques. Finally, the screening of phage libraries 
5 typically involve less difficulty with nonspecific 
binding. The alternative, plasmid cloning vectors, 
are only advantageous in the analysis of clones 
after they have been identified. This advantage is 
not lost in the present system because of the use 
10 of lambda zap, thereby permitting a plasmid 

containing the heavy chain, light chain, or Fab 
expressing inserts to be excised. 

To express the plurality of V H -coding DNA 
homologs in an E. coli host cell, a vector was 
15 constructed that placed the V H -coding DNA homologs 

in the proper reading frame, provided a ribosome 
binding site as described by Shine et al . , Nature, 
254:34, 1975, provided a leader sequence directing 
the expressed protein to the periplasmic space, 
20 provided a polynucleotide sequence that coded for a 

known epitope (epitope tag) and also provided a 
polynucleotide that coded for a spacer protein 
between the V„=coding DNA homo log and the 
polynucleotide coding for the epitope tag. A 
25 synthetic DNA sequence containing all of the above 

polynucleotides and features was constructed by- 
designing single stranded polynucleotide segments • 
of 20-40 bases that would hybridize to each other 
and form the double stranded synthetic DNA sequence 
30 shown in Figure 6. The individual single-stranded 

polynucleotides (Ni-N 12 ) are shown in Table 3. 

Polynucleotides 2 , 3 , 9-4 ' , 11 , 10-5 '■ , 6 , 
7 and 8 were kinased by adding 1 fil of each 
polynucleotide (0.1 ug/ul) and 20 units of T 4 
35 polynucleotide kinase to a solution containing 70 
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mM Tris-HCL at pH 7.6, 10 mM MgCl 2 , 5 mM DTT, 10 mM 
2 ME, 500 micrograms per ml of BSA. The solution 
was maintained at 37C for 30 minutes and the 
reaction stopped by maintaining the solution at 65C 
5 for 10 minutes. The two end polynucleotides 20 ng 
of polynucleotides Nl and polynucleotides N12, were 
added to the above kinasing reaction solution 
together with 1/10 volume of a solution containing 
20.0 mM Tris-HCL at pH 7.4, 2.0 mM MgCl 2 and 50.0 mM 

10 NaCl. This solution was heated to 70C for 5 

minutes and allowed to cool to room temperature, 
approximately 25C, over 1.5 hours in a 500 ml 
beaker of water. During this time period all 10 
polynucleotides annealed to form the double 

15 stranded synthetic DNA insert shown in Figure 6A. 
The individual polynucleotides were covalently 
linked to each other to stabilize the synthetic DNA 
insert by adding 40 nl of the above reaction to a 
solution containing 50 mM Tris-HCL at pH 7.5, 7 mM 

20 MgCl 2 , 1 mM DTT, 1 mM adenosine triphosphate (ATP) 

and 10 units of T4 DNA ligase. This solution was 
maintained at 37C for 30 minutes and then the T4 
DNA ligase was inactivated by maintaining the 
solution at 65C for 10 minutes. The end 

25 polynucleotides were kinased by mixing 52 ul of the 
above reaction, 4 fil of a solution containing 10 mM 
ATP and 5 units of T4 polynucleotide kinase. This 
solution was maintained at 37C for 30 minutes and 
then the T4 polynucleotide kinase was inactivated 

30 by maintaining the solution at 65C for 10 minutes. 

The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been 
previously digested with the restriction enzymes 
Not I and Xho I. The ligation mixture was packaged 

35 according to the manufacture's instructions using 
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Gigapack II Gold packing extract available from 
Stratagene Cloning Systems,- La Jolla, CA= The 
packaged ligation mixture was plated on XL1 blue 
cells (Stratagene Cloning Systems, San Diego, CA) . 
5 Individual lambda Zap II plagues were cored and the 

inserts excised according to the in vivo excision 
protocol provided by the manufacturer, Stratagene 
Cloning Systems, La Jolla, CA. This in vivo 
excision protocol moves the cloned insert from the 

10 lambda Zap II vector into a plasmid vector to allow 
easy manipulation and sequencing. The accuracy of 
the above cloning steps was confirmed by sequencing 
the insert using the Sanger dideoxide method 
described in by Sanger et al. , Proc. Natl. Acad. 

15 Sci USA . 74:5463-5467, (1977) and using the 

manufacture's instructions in the AMV Reverse 
Transcriptase 35 S-ATP sequencing kit f rem Stratagene 
Cloning Systems, La Jolla, CA. The sequence of the 
resulting V„ expression vector is shown in Figure 6A 

20 and Figure 7. 

Table 3 



Nl) 


5» 


GGCCGCAAATTCTATTTCAAGGAGACAGTCAT 3 1 




N2) 


5* 


AATGAAATACCTATTGCCTACGGCAGCCGCTGGATT 3 




N3) 


5' 


GTTATTACTCGCTGCCCAACCAGCCATGGCCC 3 1 




N4) 


5« 


AGGTGAAACTGCTCGAGAATTCTAGACTAGGTTAATAG 


3» 


N5) 


5 s 


TCGACTATTAACTAGTCTAGAATTCTCGAG 3 1 




N6) 


5' 


CAGTTTCACCTGGGCCATGGCTGGTTGGG 3 1 




N7) 


5' 


CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3 


NS) 


5' 


GTATTTCATTATGACTGTCTCCTTGAAATAGAATTTGC 


3' 


N9-4) 


5' 


AGGTGAAACTGCTCGAGATTTCTAGACTAGTTACCCGTAC 3 


Nil) 


5' 


GACGTTCCGGACTACGGTTCTTAATAGAATTCG 3 ' 




N12) 


5« 


TCGACGAATTCTATTAAGAACCGTAGTC 3 • 




N10-5) 


5' 


CGGAACGTCGTACGGGTAACTAGTCTAGAAATCTCGAG 


3' 



35 
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10 . V. Expression Vector Construction 
To express the plurality of V L coding 
polynucleotides in an E. coli host cell, a vector was 
constructed that placed the V L coding polynucleotide in 
5 the proper reading frame, provided a ribosome binding 
site as described by Shine et al., Nature . 254:34, 
(1975) , provided a leader sequence directing the 
expressed protein to the periplastic space and also 
provided a polynucleotide that coded for a spacer protein 

10 between the V L polynucleotide and the polynucleotide 
coding for the epitope tag. A synthetic DNA sequence 
containing all of the above polynucleotides and features 
was constructed by designing single stranded 
polynucleotide segments of 20-40 bases that would 

15 hybridize to each other and form the double stranded 
synthetic DNA sequence shown in Figure 6B. The 
individual single-stranded polynucleotides (Nj-Ng) are 
shown in Table 3. 

Polynucleotides N2, N3, N4, N6 f N7 and N8 were 

20 kinased by adding 1 ;tl of each polynucleotide and 20 

units of T 4 polynucleotide kinase to a solution containing 
70 mM Tris-HCL at pH 7.6, 10 mM MgCl 2 , 5 mM DDT, 10 mM 
2 ME, 500 micrograms per ml of BSA. The solution was 
maintained at 37C for 30 minutes and the reaction stopped 

25 by maintaining the solution at 65C for 10 minutes. The 
two end polynucleotides 20 ng of polynucleotides Nl and 
polynucleotides N5 were added to the above kinasing 
reaction solution together with 1/10 volume of a solution 
containing 20.0 mM Tris-KCL at pK 7.4, 2.0 mM MgCl 2 and 

30 50.0 mM NaCl. This solution was heated to 70 C for 5 

minutes and allowed to cool to room temperature, 
approximately 25C, over 1.5 hours in a 500 ml beaker of 
water. During this time period all the polynucleotides 
annealed to form the double stranded synthetic DNA 

35 insert. The individual polynucleotides were covalently 
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linked to each other to stabilize the synthetic DNA 
insert with adding 40 »1 of the above reaction to a 
solution containing 50 ul Tris-HCL at pH 7.5, 7 mM MgCi 2 , 
1 mM DTT, 1 mM ATP and 10 units of T4 DNA ligase. This 
5 solution was maintained at 37C for 30 minutes and then 
the T4 DNA ligase was inactivated by maintaining the 
solution at 65C for 10 minutes. The end polynucleotides 
were kinased by mixing 52 /il of the above reaction, 4 /il 
of a solution recontaining 10 mM ATP and 5 units of T4 
10 polynucleotide kinase. This solution was maintained at 
37C for 30 minutes and then the T4 polynucleotide kinase 
was inactivated by maintaining the solution at 65C for 10 
minutes. The completed synthetic DNA insert was ligated 
directly into a lambda Zap II vector that had been 
15 previously digested with the restriction enzymes Not I 

and Xho I. The ligation mixture was packaged according 
to the manufacture's instructions using Gigapack II Gold 
packing extract available from Stratagene Cloning 
Systems, La Jolla, CA. The packaged ligation mixture was 
20 plated on XL 1 -Blue cells (Stratagene Cloning Systems, La 

Jolla, CA) . Individual lambda Zap II plagues were cored 
and the inserts excised according to the in vivo excision 
protocol provided by the manufacturer, Stratagene Cloning 
systems, La Jolla, CA and described in Short et al., 
25 Nucleic Acids Res. . 16:7583-7600, 1988. This in vivo 

excision protocol moves the cloned insert from the lambda 
zap II vector into a phagemid vector to allow easy, 
manipulation and seguencing and also produces the 
phagemid version of the V L expression vectors. The 
30 accuracy of the above cloning steps was confirmed by 

sequencing the insert using the Sanger dideoxide method 
described by Sanger et al., Proc. Natl . Acad. Aci. ttsa . 
74:5463-5467, (1977) and using the manufacturers 
instructions in the AMV reverse transcriptase 35 s-dATP 
35 sequencing kit from Stratagene Cloning Systems, La Jolla, 
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CA. The sequence of the resulting V L expression vector is 
shown in Figure 6 and Figure 8. 

The V L expression vector used to construct the 
V u library was the phagemid produced to allow the DNA of 
5 the V L expression vector to be determined- The phagemid 

was produced, as detailed above, by the in vivo excision 
process from the Lambda Zap V L expression vector (Figure 
S) . The phagemid version of this vector was used because 
the Nco I restriction enzyme site is unique in this 
10 version and thus could be used to operatively linked the 
V L DNA homo logs into the expression vector. 

11. V. II-Expression Vector Construction 
To express the plurality of V L -coding DNA 

15 homologs in an E. coli host cell, a vector was 

constructed that placed the V L -coding DNA homologs in the 
proper reading frame, provided a ribosome binding site as 
described by Shine et al., Nature . 254:34, 1975, provided 
the Pel B gene leader sequence that has been previously 

20 used to successfully secrete Fab fragments in E. coli by- 

Lei et al., J. Bac . 169:4379 (1987) and Better et al., 
Science, 240:1041 (1988), and also provided a 
polynucleotide containing a restriction endonuclease site 
for cloning. A synthetic DNA sequence containing all of 

25 the above polynucleotides and features was constructed by 
designing single stranded polynucleotide segments of 20- 
60 bases that would hybridize to each other and form the 
double stranded synthetic DNA sequence shown in Figure 
10, The sequence of each individual single-stranded 

30 polynucleotides (01-08) within the double stranded 
synthetic DNA sequence is shown in Table 4. 

Polynucleotides 02, 03, 04, 05, 06 and 07 were 
kinased by adding 1 /ui (o.l ug/jil) of each polynucleotide 
and 20 units of T 4 polynucleotide kinase to a solution 

35 containing 70 mM Tris-KCL at pK 7.6, 10 mM magnesium 
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chloride (MgCl) , 5 mM dithiothreitol (DTT) , 10 mM 2- 
mercaptoethanol (2ME) , 500 micrograms per ml of bovine 
serum albumin. The solution was maintained at 37C for 30 
minutes and the reaction stopped by maintaining the 
5 solution at 65C for 10 minutes. The 20 ng each of the 
two end polynucleotides, 01 and 08, were added to the 
above kinasing reaction solution together with 1/10 
volume of a solution containing 20.0 mM Tris-KCL at pK 
7.4, 2.0 mM MgCl and 15.0 mM sodium chloride (NaCl) . 
10 This solution was heated to 70C for 5 minutes and allowed 
to cool to room temperature, approximately 25C, over 1.5 
hours in a 500 mi beaker of water. During this time 
period all 8 polynucleotides annealed to form the double 
stranded synthetic DNA insert shown in Figure 9. The 
15 individual polynucleotides were covaiently linked to each 
other to stabilize the synthetic DNA insert by adding 40 
/il of the above reaction to a solution containing 50 ml 
Tris-HCL at pH 7.5, 7 mi MgCl, 1 mm DTT, 1 mm ATP and 10 
units of T4 DNA ligase. This solution was maintained at 
20 37C for 30 minutes and then the T4 DNA ligase was 

inactivated by maintaining the solution at 65C for 10 
minutes. The end polynucleotides were kinased by mixing 
52 ^1 of the above reaction, 4 |il of a solution 
containing 10 mM ATP and 5 units of T4 polynucleotide 
25 kinase. This solution was maintained at 37C for 30 
minutes and then the T4 polynucleotide kinase was 
inactivated by maintaining the solution at 65C for. 10 
minutes. The completed synthetic DNA insert was ligated 
directly into a Lambda Zap II vector that had been 
30 previously digested with the restriction enzymes Not I 
and Xho I. The ligation mixture was packaged according 
to the manufacture's instructions using Gigapack II Gold 
packing extract available from Stratagene Cloning 
Systems, La Jolla, CA. The packaged ligation mixture was 
35 plated on XL1 blue cells (Stratagene Cloning Systems, San 
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Diego. CA) . Individual Lambda Zap II plaques were cored 
and the inserts excised according to the in vivo excision 
protocol provided by the manufacturer, Stratagene Cloning 
Systems. La Jolla, CA. This in vivo excision protocol 
moves the cloned insert from the Lambda Zap II vector 
into a plasmid vector to allow easy manipulation and 
sequencing. The accuracy of the above cloning steps was 
confirmed by sequencing the insert using the 
manufacture's instructions in the AMV Reverse 
Transcriptase 35 S-dATP sequencing kit from stratagene 
Cloning Systems, La Jolla, CA. The sequence of the 
resulting V L II-expression vector is shown in Figure 9 and 
Figure 11. 

TABLE 4 

01) 5« TGAATTCTAAACTAGTCGCCAAGGAGACAGTCAT 3' 
02} 5' AATGAAATACCTATTGCCTACGGCAGCCGCTGGAxT 3 : 

03) 5' GTTATTACTCGCTGCCCAACCAGCCATGGCC 3» 

04) 5' GAGCTCGTCAGTTCTAGAGTTAAGCGGCCG 3' 

05) 5' GTATTTCATTATGACTGTCTCCTTGGCGACTAGTTTAGAA- 

TTCAAGCT 3 * 

06) 5' CAGCGAGTAATAACAATCCAGCGGCTGCCGTAGGCAATAG 3' 

07) 5' TGACGAGCTCGGCCATGGCTGGTTGGG 3' 

08) 5' TCGACGGCCGCTTAACTCTAGAAC 3' 

12. V, + V, Library Construction 

To prepare an expression library enriched in V H 
sequences, DNA homologs enriched in V K sequences were 
prepared according to Example 6 using the same set of 5' 
primers but with primer 12A (Table 1) as the 3' primer. 
These homologs were then digested with the restriction 
enzymes Xho I and Spe I and purified on a 1% agarose gel 
using the standard electroelution technique described in 
Molecular Cloning A Laboratory Manual . Maniatis et aL f 
eds., Cold Spring Harbor, NY, (1382) . These prepared V H 
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DNA homologs were then directly inserted into the V H 
expression vector that had been previously digested with 
Xho I and Spe I. 

The ligation mixture containing the V H DNA 
5 homologs were packaged according to the manufacturers 
specifications using Gigapack Gold II Packing Extract 
(Stratagene Cloning Systems, La Jolla, CA) . The 
expression libraries were then ready to be plated on XL-1 
Blue ceils. 

10 To prepare a library enriched in V L sequences, 

PGR amplified products enriched in V L sequences were 
prepared according to Example 6. These V L DNA homologs 
were digested with restriction enzymes Nco I and Spe I. 
The digested V L DNA homologs were purified on a 1% agarose 

15 gel using standard electroeiusion techniques described in 
Molecular Cloning A Laboratory Manual, Maniatis et al., 
eds., Cold Spring Harbor, NY (19S2) . The prepared V L DNA 
homologs were directly inserted into the V L expression 
vector that had been previously digested with the 

20 restriction enzymes Nco I and Spe I. The ligation 

mixture containing the V L DNA homologs were transformed 
into XL-1 blue competent cells using the manufacturer's 
instructions (Stratagene Cloning Systems, La Jolla, CA) . 

25 13 . inserting V, Coding DNA Homologs 

Into V, Expression Vector 
In preparation for cloning a library enriched 
in V L sequences, PCR amplified products (2.5 ug/30 ul of 
150 mM NaCl, S mM Tris-HCl (pH 7.5) , 6 mM MgS0 4 , 1 mM DTT, 

30 200 ug/ml BSA at 37C were digested with restriction 
enzymes Sac I (125 units) and Xba I (125 units) and 
purified on a 1% agarose gel. In cloning experiments 
which required a mixture of the products of the 
amplification reactions, equal volumes (50 ul, 1-10 ug 

35 concentration) of each reaction mixture were combined 
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after amplification but before restriction digestion. 
After gel electrophoresis of the digested FCR amplified 
spleen mRNA, the region of the gel containing DNA 
fragments of approximate 350 bps was excised , 
5 electroeluted into a dialysis membrane, ethanol 

precipitated and resuspended in a TE solution containing 
10 mM Tris-HCl pH 7.5 and 1 mM EDTA to a final 
concentration of 50 ng/ul. 

The V L II-expression DNA vector was prepared for 

10 cloning by admixing 100 ug of this DNA to a solution 
containing 250 units each of the restriction 
endonucleases Sac 1 and Xba 1 (both from Boehringer 
Mannheim. Indianapolis,- IN) and a buffer recommended by 
the manufacturer. This solution was maintained at 37 

15 from 1.5 hours. The solution was heated at 65C for 15 

minutes top inactivate the restriction endonucleases. 
The solution was chilled to 30C and 25 units of heat- 
killable (HK) phosphatase (Epicenter, Madison, WI) and 
CaCl 2 were admixed to it according to the manufacturer's 

20 specifications. This solution was maintained at 30C for 

1 hour. The DNA was purified by extracting the solution 
with a mixture of phenol and chloroform followed by 
ethanol precipitation. The V L II expression vector was now 
ready for ligation to the v L DNA homologs prepared in the 

25 above examples. 

DNA homologs enriched in V L sequences were 
prepared according to Example 5 but using a 5 1 light 
chain primer and the 3' light chain primer shown in Table 
2. Individual amplification reactions were carried out 

30 using each 5" light chain primer in combination with the 
3' light chain primer. These separate v L homolog 
containing reaction mixtures were mixed and digested with 
the restriction endonucleases Sac 1 and Xba 1 according 
to Example 6. The V L homologs were purified on a 1% 

35 agarose gel using the standard electroelution technique 
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described in Molecular Cloning A Laboratory Manual . 
Maniatis et al. , sds. , Cold Spring Harbor, NY, (1982). 
These prepared v L DNA homoiogs were then directly inserted 
into the Sac 1 - Xba cleaved V L I I -express ion vector that 
5 was prepared above by ligating 3 moles of V L DNA hcmclcg 
inserts with each mole of the V L II-expression vector 
overnight at 5C. 3.0 x 10 s plague forming units were 
obtained after packaging the DNA with Gigapack II Bold 
(Stratagene Cloning Systems, La Jolla, CA) and 50% were 
10 recombinants. 

14. Randomly Combining V u and V. DNA 
Homoiogs on the Same Expression 
Vector 

15 The V L II-expression library prepared in Example 

13 was amplified and 500 ug of V L 1 1 -express ion library 
phage DNA prepared from the amplified phage stock using 
the procedures described in Molecular Cloning: A 
Laboratory Manual. Maniatis et al., eds., Cold Spring 

20 Harbor Laboratory, Cold Spring Harbor, NY (1S82) , 50 ug 

of this V L II-expression library phage DNA was maintained 
in a solution containing 100 units of MLuI restriction 
endonuclease (Boehringer Mannheim, Indianapolis, IN) in 
200 ul of a buffer supplied by the endonuclease 

25 manufacturer for 1.5 hours at 37C. The solution was then 
extracted with a mixture of phenol and chloroform. The 
DNA was then ethanol precipitated and resuspended in 100 
ul of water. This solution was admixed with 100 units of 
the restriction endonuclease EcoR I (Boehringer Mannheim, 

30 Indianapolis, IN) in a final volume of 200 ul of buffer 

containing the components specified by the manufacturer. 
This solution was maintained at 37C for 1.5 hours and the 
solution was then extracted with a mixture of phenol and 
chloroform. The DNA was ethanol precipitated and the DNA 

35 resuspended in TE. 
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The Y H expression library prepared in Example 12 
was amplified and 500 ug of v H expression library phage 
DNA prepared using the methods detailed above. 50 ug of 
the V H expression library phage DNA was maintained in a 
5 solution containing 100 units of Hind III restriction 

endonuclease (Boehringer Mannheim, Indianapolis, IN) in 
200 ul of a buffer supplied by the endonuclease 
manufacturer for 1.5 hours at 37C. The solution was then 
extracted with a mixture of phenol and chloroform 

10 saturated with 0.1 M Tris-HCL at pH 7.5. The DNA was 

then ethanol precipitated and resuspended in 100 ui of 
water. This solution was admixed with 100 units of the 
restriction endonuclease EcoR I (Boehringer Mannheim, 
Indianapolis, IN) in a final volume of 200 .ul of buffer 

15 containing the components specified by the manufacturer. 

This solution was maintained at 37C for 1.5 hours and the 
solution was then extracted with a mixture of phenol and 
chloroform. The DNA was ethanol precipitated and the DNA 
resuspended in TE, 

20 The restriction digested V H and V L II-expression 

Libraries were ligated together. The ligation reaction 
consisted of 1 ug of V H and 1 ug of V L II phage library DNA 
in a 10 ul reaction using the reagents supplied in a 
ligation kit purchased from Stratagene Cloning Systems 

25 (La Jolla, California) * After ligation for 16 hr at 4C, 

1 ul of the ligated the phage DNA was packaged with 
Gigapack Gold II packaging extract and plated on XL 1- 
blue cells prepared according to the manufacturers 
instructions. A portion of the 3X10 6 clones obtained were 

30 used to determine the effectiveness of the combination. 
The resulting V H and V L expression vector is shown in 
Figure 11. 

Clones containing both V H and V L were excised 
from the phage to pBluescript using the in vitro excision 
35 protocol described by Short et ai., Nucleic Acid 
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Research . 16:7583-7600 (1988). Clones chosen for 
excision expressed the decapeptide tag and did not cleave 
X-gai in the presence of 2mM IPTGthus remaining white. 
Clones with these characteristics represented 30% of the 
5 library. 50% of the clones chosen for excision contained 
a v H and V L as determined by restriction analysis. Since 
approximately 30% of the clones in the V H library 
expressed the decapeptide tag and 50% of the clones in 
the V L II library contained a V L sequence it was 
10 anticipated that no more than 15% of the clones in the 

combined library would contain both V rt and V L clones. The 
actual number obtained was 15% of the library indicating 
that the process of combination was very efficient. 

15 15. Segregating DNA Homologs For a v„ 

Antigen Binding Protein 
To segregate the individual clones containing 
DNA homologs that code for a V H antigen binding protein, 
the titre of the V H expression library prepared according 
20 to Example 11 was determined. This library titration was 

performed using methods well known to one skilled in the 
art. Briefly, serial dilutions of the library were made 
into a buffer containing 1QQ mM NaCl, 50 mM Tris-HCL at 
pH 7.5 and 10 mM MgSO*. Ten ul of each dilution was added 
25 to 200 ul of exponentially growing E. coli cells and 

maintained at 37C for 15 minutes to allow the phage to 
absorb to the bacterial cells. Three ml of top agar 
consisting of 5 g/L NaCl, 2 g/L of MgS0 4 , 5 g/L yeast 
extract, 10 g/L NZ amine (casein hydrolysate) and 0.7% 
melted, 50C agarose. The phage, the bacteria and the top 
agar were mixed and then evenly distributed across the 
surface of a prewarmed bacterial agar plate (5 g/L NaCl, 
2 g/L MgSO^, 5 g/L yeast extract, 10 g/L NZ amine (casein 
hydrolysate) and 15 g/L Difco agar. The plates were 
maintained at 37C for 12 to 24 hours during which time 
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period the lambda plaques developed on the bacterial 
lawn. The lambda plaques were counted to determined the 
total number of plaque forming units per ml in the 
original library, 
5 The titred expression library was then plated 

out so that replica filters could be made from the 
library. The replica filters will be used to later 
segregate out the individual clones in the library that 
are expressing the antigens binding proteins of interest. 

10 Briefly, a volume of the titred library that would yield 
20,000 plaques per 150 millimeter plate was added to 600 
ul of exponentially growing E. coli cells and maintained 
at 37C for 15 minutes to allow the phage to absorb to the 
bacterial cells. Then 7.5 ml of top agar was admixed to 

15 the solution containing the bacterial cells and the 

absorbed phage and the entire mixture distributed evenly 
across the surface of a prewarmed bacterial agar plate. 
This process was repeated for a sufficient number of 
plates to plate out a total number of plaques at least 

20 equal to the library size. These plates were then 

maintained at 37 C for 5 hours. The plates were then 
overlaid with nitrocellulose filters that had been 
pretreated with a solution containing 10 mM isopropyl- 
beta-D-thiogalactopyranoside (IPTG) and maintained at 37C 

25 for 4 hours. The orientation of the nitrocellulose 

filters in relation to the plate were marked by punching 
a hole with a needle dipped in waterproof ink through the 
filter and into the bacterial plates at several 
locations. The nitrocellulose filters were removed with 

30 forceps and washed once in a TBST solution containing 20 
mM Tris-HCL at pH 7.5, 150 mM NaCl and 0.05% 
poiyoxyethyiene soriban monoiaurate (Tween-20) . A second 
nitrocellulose filter that had also been soaked in a 
solution containing 10 mM IPTG was reapplied to the 

35 bacterial plates to produce duplicate filters. The 
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filters were further washed in a fresh solution of TBST 
for 15 minutes. Filters were then placed in a blocking 
solution consisting of 20 mM Tris-HCL at pK 7.5, 150 mM 
NaCL and 1% BSA and agitated for 1 hour at room 
5 temperature. The nitrocellulose filters were transferred 

to a fresh blocking solution containing a 1 to 500 
dilution of the primary antibody and gently agitated for 
at least 1 hour at room temperature. After the filters 
were agitated in the solution containing the primary 
10 antibody the filters were washed 3 to 5 times in TBST for 

5 minutes each time to remove any of the residual unbound 
primary antibody. The filters were transferred into a 
solution containing fresh blocking solution and a 1 to 
500 to a 1 to 1,000 dilution of alkaline phosphatase 
15 conjugated secondary antibody. The filters were gently 

agitated in the solution for at least 1 hour at room 
temperature. The filters were washed 3 to 5 times in a 
solution of TBST for at least 5 minutes each time to 
remove any residual unbound secondary antibody. The 
20 filters were washed once in a solution containing 20 mM 
Tris-HCL at pH 7.5 and 150 mM NaCL. The filters were 
removed from this solution and the excess moisture 
blotted from them with filter paper. The color was 
developed by placing the filter in a solution containing 
100 mM Tris-HCL at pH 9.5, 100 mM NaCl, 5 mM MgCl 2 , 0.3 
mg/ml of nitre Blue Tetrazolium (NET) and 0.15 mg/ml of 
5-bromo-4-chloro-3-indolyl-phosphate (BCIP) for at least 
30 minutes at room temperature. The residual color 
development solution was rinsed from the filter with a 
solution containing 20 mM Tris-HCL at pH 7.5 and 150 mM 
NaCl. The filter was then placed in a stop solution 
consisting of 20 mM Tris=KCL at pK 2.3 and 1 mM EDTA. 
The development of an intense purple color indicates at 
positive result. The filters are used to locate the 
phage plaque that produced the desired protein. That 
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phage plague is segregated and then grown up for further 
analysis. 

Several different combinations of primary 
antibodies and second antibodies were used. The first 
5 combination used a primary antibody immunospecif ic for a 

decapeptide that will be expressed only if the V H antigen 
binding protein is expressed in the proper reading frame 
to allow read through translation to include the 
decapeptide epitope covalently attached to the V H antigen 

10 binding protein. This decapeptide epitope and an 

antibody immunospecific for this decapeptide epitope was 
described by Green et al., Cell 28:477 (1982) and Niman 
et al, f Proc. Nat, Acad, Sci. U.S.A. 80:4949 (1983). The 
sequence of the decapeptide recognized is shown in Figure 

15 11. A functional equivalent of the monoclonal antibody 

that is immunospecific for the decapeptide can be 
prepared according to the methods of Green et al. and 
Niman et al. The secondary antibody used with this 
primary antibody was a goat antimouse IgG (Fisher 

20 Scientific) . This antibody was immunospecific for the 

constant region of mouse IgG and did not recognize any 
portion of the variable region of heavy chain. This 
particular combination of primary and secondary 
antibodies when used according to the above protocol 

25 determined that between 25% and 30% of the clones were 

expressing the decapeptide and therefore these clones 
were assumed to also be expressing a V H antigen binding 
protein. 

In another combination the anti-decapeptide 
30 mouse monoclonal was used as the primary antibody and an 
affinity purified goat anti-mouse Ig,- commercially 
available as part of the picoBlue immunoscreening kit 
from Stratagene Cloning System, La Jolla, CA, was use as 
the secondary antibody. This combination resulted in a 
35 large number of false positive clones because the 
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secondary antibody also immunoreacted with the V H of the 
heavy chain Therefore this antibody reacted with all 
clones expressing any v H protein and this combination of 
primary and secondary antibodies did not specifically 
5 detect clones with the V H polynucleotide in the proper 
reading frame and thus allowing expressing of the 
decapeptide . 

. Several combinations of primary and secondary 
antibodies are used where the primary antibody is 
10 conjugated to fluorescein isothiocyanate (FITC) and thus 

the immunospecificity of the antibody was not important 
because the antibody is conjugated to the preselected 
antigen (FITC) and it is that antigen that should be 
bound by the V H antigen binding proteins produced by the 
15 clones in the expression library. After this primary 
antibody has bound by virtue that is FITC conjugated 
mouse monoclonal antibody p2 5764 (ATCC #KB=9505) . The 
secondary antibody used with this primary antibody is a 
goat anti-mouse Ig 6 (Fisher Scientific, Pittsburg, PA) 
20 conjugated to alkaline phosphatase. Using the method 

described in Antibodies A Laboratory Manual , Harlow and 
Lowe, eds. , Cold Springing Harbor, NY, (1988) . If a 
particular clone in the V H expression, library, expresses 
a V H binding protein that binds the FITC covalently 
coupled to the primary antibody, the secondary antibody 
binds specifically and when developed the alkaline 
phosphate causes a distinct purple color to form. 

The second combination of antibodies of the 
type uses a primary antibody that is FITC conjugated 
rabbit anti-human IgG (Fisher Scientific, Pittsburg, PA) . 
The secondary antibody used with this primary antibody is 
a goat anti-rabbit IgG conjugated to alkaline phosphatase 
using the methods described in Antibodies A Laboratory 
Manual , Harlow and Lane, eds.. Cold Spring Harbor, NY, 
(1988) . If a particular clone in the V H expression 
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library expresses a V B binding protein that binds the FITC 
conjugated to the primary antibody, the secondary 
antibody binds specifically and when developed the 
alkaline phosphatase causes a distinct purple color to 
5 form. 

Another primary antibody was the mouse 

monoclonal antibody p2 5764 (ATCC # HB-9505) conjugated 
to both FITC and 125 I. The antibody would be bound by any 
V H antigen binding proteins expressed. Then because the 

10 antibody is also labeled with 125 I f an autoradiogram of 

the filter is made instead of using a secondary antibody 
that is conjugated to alkaline phosphatase. This direct 
production of an autoradiogram allows segregation of the 
clones in the library expressing a V H antigen binding 

15 protein of interest. 

16. Segregating DNA Hom olocrs For a 
Vh and V, that Form an Antigen 
Binding F v 

20 To segregate the individual clones containing 

DNA homologs that code for a V„ and a V L that form an 
antigen binding F y the V H and V L expression library was 
titred according to Example 15. The titred expression 
library was then screened for the presence of the 

25 decapeptide tag expressed with the V K using the methods 

described in Example 15. DNA was then prepared from the 
clones to express the decapeptide tag. This DNA was 
digested with the restriction endonuelease Pvu II to 
determine whether these clones also contained a v L DNA 

30 homolog. The slower migration of a PvuII restriction 

endonuelease fragment indicated that the particular clone 
contained both a V„ and a V L DNA homolog. 

The clones containing both a V H and a V L DNA 
homolog were analyzed to determine whether these clones 

35 produced an assembled F v protein molecule from the v H and 
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The F- v - protein fragment produced in clones 
containing both v H and V L was visualized by immune 
precipitation of radiolabeled protein expressed in the 
5 clones. A 50 ml culture of LB broth (5 g/L yeast 
extract, 10 g/L and tryptone 10 g/L NaCl at pH 7.0) 
containing 100 ug/ul of ampicillin was inoculated with E^. 
Coli harboring a plasmid contain a V H and a V L . The 
culture was maintained at 37C with shaking until the 
10 optical density measured at 550 nm was 0.5 culture then 

was centrifuged at 3,000 g for 10 minutes and resuspended 
in 50 ml of M9 media (6 g/L Na 2 HP0 4 , 3 g/L KH 2 P0 4 , 0.5 g/L 
NaCl, 1 g/L NH 4 C1, 2g/L glucose. 2 mM MgS0 4 and 0.1 
mMgS0 4 CaCl 2 supplemented with amino acids without 
15 methionine or cysteine. This solution was maintained at 
37C for 5 minutes and then 0.5 mCi of 35 S as HS0 4 " (New 
England Nuclear, Boston, MA) was added and the solution 
was further maintained at 37C for an additional 2 hours. 
The solution was then centrifuged at 3000xg and the 
20 supernatant discarded. The resulting bacterial cell 

pellet was frozen and thawed and then resuspended in a 
solution containing 40 mM Tris pH 8.0, 100 mM sucrose and 
1 mM EDTA. The solution was centrifuged at lOOOOxg for 
10 minutes and the resulting pellet discarded. The 
25 supernatant was admixed with 10 ul of anti-decapeptide 

monoclonal antibody and maintained for 30-90 minutes on 
ice. 40 ul of protein G coupled to sepharose beads 
(Pharmacia, Piscataway, NJ) was admixed to the solution 
and the added solution maintained for 30 minutes on ice 
30 to allow an immune precipitate to form. The solution was 

centrifuged at 10,000 xg for 10 minutes and the resulting 
pellet was resuspended in 1 ml of a solution containing 
100 mM Tris-HCL at pH 7.5 and centrifuged at 10,000 xg 
for 10 minutes. This procedure was repeated twice. The 
35 resulting immune precipitate pellet was loaded onto a 
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PhastGel Homogenous 20 gel (Pharmacia. Piscataway. NJ) 
according to the manufacturer's directions. The gel was 
dried and used to expose X-ray film. 

The resulting autoradiogram is shown in Figure 
5 12. The presence of assembled F v molecules can be seen by 

the presence of V L that was immunoprecipitated because it 
was attached to the V H -decapeptide tag recognized by the 
precipitating antibody. 



10 

17 . Generation of a Large Combinatorial 

Library of the Immunoglobulin Repertoire 

in Phage 

Vectors suitable for expression of V H , V L , Fv 

15 and Fab sequences are diagrammed in Figures 7 and 9. As 
previously discussed, the vectors were constructed by 
modification of Lambda Zap by inserting synthetic 
oligonucleotides into the multiple cloning site. The 
vectors were designed to be antisymmetric with respect to 

20 the Not I and EcoR I restriction sites which flank the 

cloning and expression sequences. As described below, 
this antisymmetry in the placement of restriction sites 
in a linear vector like bacteriophage is the essential 
feature of the system which allows a library expressing 

25 light chains to be combined with one expressing heavy 

chains to construct combinatorial Fab expression 
libraries. Lambda Zap II V L II (Figure 9) is designed to 
serve as a cloning vector for light chain fragments and 
Lambda Zap II V H (Figure 7) is designed to serve as a 

30 cloning vector for heavy chain sequences in the initial 
step of library construction. These vectors are 
engineered to efficiently clone the products of PCR 
amplification with specific restriction sites 
incorporated at each end. 

35 A. PCR Amplification of Antibo dy Fragments 
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The PCR amplification of mRNA isolated from 
spleen cells with oligonucleotides which incorporate 
restriction sites into the ends of the amplified product 
can be used to clone and express heavy chain sequences 
5 including Fd and kappa chain sequences. The 

oligonucleotide primers used for these amplifications are 
presented in Tables 1 and 2. The primers are analogous 
to those which have been successfully used in Example 5 
for amplification of V H sequences. The set of 5* primers 

10 for heavy chain amplification were identical to those 

previously used to amplify V H and those for light chain 
amplification were chosen on similar principles, Sastry 
et al., Proc. Natl. Acad. Sci. USA . 8G: 5728 (1989) and 
Orland et al., Proc Natl. Acad. Sci. USA, 8G:3833 (1989). 

15 The unique 3' primers of heavy (IgGl) and light (k) chain 
sequences were chosen to include the cysteines involved 
in heavy-light chain disulfide bond formation. At this 
stage no primer was constructed to amplify lambda light 
chains since they constitute only a small fraction of 

20 murine antibodies. In addition, Fv fragments have been 
constructed using a 3 1 primer which is complementary to 
the mRNA in the J (joining) region (amino acid 128) and a 
set of unique 5 ! primers which are complementary to the 
first strand cDNA in the conserved N-terminal region of 

25 the processed protein. Restriction endonuclease 

recognition sequences are incorporated into the primers 
to allow for the cloning of the amplified fragment into a 
lambda phage vector in a predetermined reading frame for 
expression. 

30 B. Library Construction 

The construction of a combinatorial library was 
accomplished in two steps. In the first step, separate 
heavy and light chain libraries were constructed in 
Lambda Zap II V H and Lambda Zap II V L II respectively. In 

35 the second step, these two libraries were combined at the 
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antisymmetric- EcoRl sites present in each vector. This 
resulted in a library of clones each of which potentially 
co-expresses a heavy and a light chain. The actual 
combinations are random and do not necessarily reflect 
5 the combinations present in the B-ceii population in the 
parent animal. Lambda Zap II V H expression vector has 
been used to create a library of heavy chain sequences 
from DNA obtained by PCR amplification of mRNA isolated 
from the spleen of a 129 G ix + mouse previously immunized 

10 with p-nitrophenyl phosphoramidate (NPN) antigen 1 

according to formula I (Figure 13) conjugated to keyhole 
limpet hemocyanin (KLH) . The NPN-KLH conjugate was 

prepared by admixture of 250 ul of a solution containing 
2.5 mg of NPN according to formula 1 (Figure 13) in 

15 dimethyl formamide with 750 ul of a solution containing 2 

mg of KLH in 0,01 M sodium phosphate buffer (pH 7.2) . 
The two solutions were admixed by slow addition of the 
NPN solution to the KLH solution while the KLH solution 
was being agitated by a rotating stirring bar, 

20 Thereafter the admixture was maintained at 4 = for 1 hour 

with the same agitation to allow conjugation to proceed. 
The conjugated NPN-KLH was isolated from the 
nonconjugated NPN and KLH by gel filtration through 
Sephadex G-25. The isolated NPN-KLH conjugate was used 

25 in mouse immunizations as described in Example 2, 

The spleen mRNA resulting from the above 
immunizations was isolated and used to create a primary 
library of V H gene sequences using the Lambda Zap II V H 
expression vector. The primary library contains 1.3 x 10 6 

30 pfu and has been screened for the expression of the 

decapeptide tag to determine the percentage of clones 
expressing Fd sequences. The sequence for this peptide 
is only in frame for expression following the cloning of 
a Fd (or V„) fragment into the vector. At least 80% of 

35 the clones in the library express Fd fragments based on 
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immuno-detection of the decapeptide tag. 

The light chain library was constructed in the 
same way as the heavy chain and shown to contain 2.5 x 10* 
members. Plague screening, using an anti-kappa chain 
5 antibody, indicated that 60% of the library contained 
expressed light chain inserts. This relatively small 
percentage of inserts probably resulted from incomplete 
dephosphorylation of vector after cleavage with Sac I and 
Xba I. 

10 Once obtained, the two libraries were used to 

construct a combinatorial library by crossing them at the 
EcoR I site. To accomplish the cross, DNA was first 
purified from each library. The light chain library was 
cleaved with Mlul restriction endonuelease, the resulting 
15 5' ends dephosphorylated and the product digested with 

EcoR I. This process cleaved the left arm of the vector 
into several pieces but the right arm containing the 
light chain sequences, remained intact. In a parallel 
fashion, the DNA of heavy chain library was cleaved with 
20 Kindlll, dephosphorylated and cleaved with EcoR I, 

destroying the right arm but leaving the left arm 
containing the heavy chain sequences intact. The DNA 1 s 
so prepared were then combined and ligated. After 
ligation only clones which resulted from combination of a 
25 right arm of light chain-containing clones and a left arm 

of heavy chain-containing clones reconstituted a viable 
phage. After ligation and packaging, 2.5 x 10 7 clones 
were obtained. This is the combinatorial Fab expression 
library that was screened to identify clones having 
30 affinity for NPN. To determine the frequency the phage 

clones which co-express the light and heavy chain 
fragments, duplicate lifts of the light chain, heavy 
chain and combinatorial libraries were screened as above 
for light and heavy chain expression. In this study of 
35 approximately 500 recombinant phage approximately 60% co- 
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expressed light and heavy chain proteins, 
C. Antigen Binding 

All three libraries, the light chain, the heavy 
chain and Fab were screened to determine if they 
5 contained recombinant phage that expressed antibody 

fragments binding NPN. In a typical procedure 30,000 
phage were plated and duplicate lifts with nitrocellulose 
screened for binding to NPN coupled to 125 I labeled BSA 
(Figure 15). Duplicate screens of 80,000 recombinant 

10 phage from the light chain library and a similar number 
from the heavy chain library did not identify any clones 
which bound the antigen. In contrast, the screen of a 
similar number of clones from the Fab expression library 
identified many phage plaques that bound NPN (Figure 15) . 

15 This observation indicates that under conditions where 
many heavy chains in combination with light chains bind 
to antigen the same heavy or light chains alone do not. 
Therefore, in the case of NPN, it is believed that there 
are many heavy and light chains that only bind antigen 

20 when they are combined with specific light and heavy 
chains respectively. 

To assess the ability to screen large numbers 
of clones and obtain a more quantitative estimate of the 
frequency of antigen binding clones in the combinatorial 

25 library, one million phage plaques were screened and 

approximately 100 clones which bound to antigen were 
identified. For six clones which were believed to bind 
NPN, a region of the plate containing the positive and 
approximately 20 surrounding bacteriophage plaques was 

30 "cored", replated, and screened with duplicate lifts 

(Figure 15) . As expected, approximately one in twenty of 
the phage specifically bind to antigen. "Cores" of 
regions of the plated phage believed to be negative did 
not give positives on replating. 
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To determine the specificity of the antigen- 
antibody interaction, antigen binding was competed with 
free unlabeled antigen as shown in Figure 16. 
Competition studies showed that individual clones could 
5 be distinguished on the basis of antigen affinity. The 

concentration of free haptens required for complete 
inhibition of binding varied between 10-100 x 10 9 M 
suggesting that the expressed Fab fragments had binding 
constants in the nanomolar range. 

10 

D. Composition of the Clones and ThRir 

Expressed Products 
In preparation for characterization of the 
protein products able to bind NPN as described in Example 
15 18C, a plasmid containing the heavy and light chain genes 

was excised from the appropriate " cored" bacteriophage 
plaque using !H3ir.p3 helper phage. Mapping of the excised 
plasmid demonstrated a restriction pattern consistent 
with incorporation of heavy and light chain sequences, 
20 The protein products of one of the clones was analyzed by 

ELISA and Western blotting to establish the composition 
of the NPN binding protein. A bacterial supernate 
following IPTG induction was concentrated and subjected 
to gel filtration. Fractions in the molecular weight 
25 range 40-60 kD were pooled, concentrated and subjected to 
a further gel filtration separation. As illustrated in 
Figure 17, ELISA analysis of the eluting fractions • 
demonstrated that NPN binding was associated with a 
protein of molecular weight about 50 kD which 
30 immunological detection showed contained both heavy and 
light chains. A Western blot (not shown) of a 
concentrated bacterial supernate preparation under non- 
reducing conditions was developed with anti-decapeptide 
antibody. This revealed a protein band of molecular 
35 weight of 50 kD. Taken together these results are 
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consistent with NPN binding being a function of Fab 
fragments in which heavy and light chains are covaientiy 
linked. 

E, Comparison of the Propertie s of the In 
5 Vivo Repertoire V ersus the Phage 

Combinatorial Library 
In this example a relatively restricted library 
was prepared because only a limited number of primers 
were used for PCR amplification of Fd sequences. The 

1Q library is expected to contain only clones expressing 

kappa/gammai sequences. However, this is not an inherent 
limitation of the method since additional primers can be 
added to amplify any antibody class or subclass. Despite 
this restriction we were able to isolate a large number 

15 of antigen binding clones. 

A central issue arising from this work is how a 
phage library prepared as described herein compares with 
the in vivo antibody repertoire in terms of size, 
characteristics of diversity, and ease of access. 

20 The size of the mammalian antibody repertoire 

is difficult to judge but a figure of the order of 10 6 -10 8 
different antigen specificities is often quoted. With 
some of the reservations discussed below, a phage library 
of this size or larger can readily be constructed by a 

25 modification of the current method. In fact once an 

initial combinatorial library has been constructed, heavy 
and light chains can be shuffled to obtain libraries of 
exceptionally large numbers. 

In principle, the diversity characteristics of 

30 the naive (unimmunized) in vivo repertoire and 

corresponding phage library are expected to be similar in 
that both involve a random combination of heavy and light 
chains. However, different factors will act to restrict 
the diversity expressed by an in vivo repertoire and 

35 phage library. For example a physiological modification 
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such as tolerance will restrict the expression of certain 
antigenic specificities from the in vivo repertoire but 
these specificities may still appear in the phage 
library. On the other hand, bias in the cloning process 
5 may introduce restrictions into the diversity of the 

phage library. For example the representation of mRNA 
for sequences expressed by stimulated B-cells can be 
expected to predominate over those of unstimulated ceils 
because of higher levels of expression. Different source 
10 tissues (e.g., peripheral blood, bone marrow or regional 

lymph nodes) and different PGR primers (e.g., ones 
expected to amplify different antibody classes) may 
result in libraries with different diversity 
characteristics . 
15 Another difference between in vivo repertoire 

and phage library is that antibodies isolated from the 
former may have benefited from affinity maturation due to 
somatic mutations after combination of heavy and light 
chains whereas the latter randomly combines the matured 
20 heavy and light chains. Given a large enough phage 

library derived from a particular in vivo repertoire, the 
original matured heavy and light chains will be 
recombined. However, since one of the potential benefits 
of this new technology is to obviate the need for 
25 immunization by the generation of a single highly diverse 

"generic" phage library, it would be useful to have 
methods to optimize sequences to compensate for the 
absence of somatic mutation and clonal selection. Three 
procedures are made readily available through the methods 
30 of the present invention. First, saturation mutagenesis 
may be performed on the CDR's and the resulting Fabs can 
be assayed for increased function. Second, a heavy or a 
light chain of a clone which binds antigen can be 
recombined with the entire light or heavy chain libraries 
35 respectively in a procedure identical to the one used to 
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construct the combinatorial library. Third , iterative 
cycles of the two above procedures can be performed to 
further optimize the affinity or catalytic properties of 
the immunoglobulin. It should be noted that the latter 
5 two procedures are not permitted in B-cell clonal 

selection which suggests that the methods described here 
may actually increase the ability to identify optimal 
sequences . 

Access is the third area where it is of 

10 interest to compare the in vivo antibody repertoire and 
phage library. In practical terms the phage library is 
much easier to access. The screening methods allow one 
to survey at least 50,-000 clones per plate so that ID 6 
antibodies can be readily examined in a day. This factor 

15 alone should encourage the replacement of hybridoma 

technology with the methods described here. The most 
powerful screening methods utilize selection which may be 
accomplished by incorporating selectable markers into the 
antigen such as leaving groups necessary for replication 

20 of auxotrophic bacterial strains or toxic substituents 

susceptible to catalytic inactivation. There are also 
further advantages related to the fact that the in vivo 
antibody repertoire can only be accessed via immunization 
which is a selection on the basis of binding affinity. 

25 The phage library is not similarly restricted. For 

example, the only general method to identify antibodies 
with catalytic properties has been by pre-selection on 
the basis of affinity of the antibody to a transition 
state analogue. No such restrictions apply to the in 

30 vitro library where catalysis can, in principle, be 

assayed directly. The ability to directly assay large 
numbers of antibodies for function may allow selection 
for catalysts in reactions where a mechanism is not well 
defined or synthesis of the transition state analog is 

35 difficult. Assaying for catalysis directly eliminates 
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the bias of the screening procedure for reaction 
mechanisms pejorative to a synthetic analog and therefore 
simultaneous exploration of multiple reaction pathways 
for a given chemical transformation are possible. 
5 The methods disclosed herein describe 

generation of Fab fragments which are clearly different 
in a number of important respects from intact (whole) 
antibodies. There is undoubtedly a loss of affinity in 
having monovalent Fab antigen binders but this can be 

10 compensated by selection of suitably tight binders. For 
a number of applications such as diagnostics and 
biosensors it may be preferable to have monovalent Fab 
fragments. For applications requiring Fc effector 
functions, the technology already exists for extending 

15 the heavy chain gene and expressing the glycosylated 
whole antibody in mammalian cells. 

The ideas presented here address the bottle 
neck in the identification and evaluation of antibodies. 
It is now possible to construct and screen at least three 

20 orders of magnitude more clones with mono=specif icity 

than previously possible. The potential applications of 
the method should span basic research and applied 
sciences. 

The foregoing is intended as illustrative of 
25 the present invention but not limiting. Numerous 

variations and modifications can be effected without 
departing from the true spirit and scope of the 
invention. 
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What Is Claimed Is: 

1. A method of producing a linear double 
stranded DNA expression vector containing a 5' terminal 
promoter controlling the expression of first and second 
5 polypeptides from respective first and second genes, said 
first and second polypeptides being capable of forming a 
heterodimeric receptor of predetermined specificity, 
which method comprises: 

(a) isolating a first diverse population of 
10 first genes containing a gene coding for said first 

polypeptide, said first population being constituted by- 
linear double stranded DNA molecules defining a 
restriction site recognized by an endonuclease, said 
restriction site located 5 1 terminal to said genes but 3 ' 
15 terminal to a promoter controlling the expression of said 
genes ; 

(b) isolating a second diverse population of 
second genes containing a gene coding for said second 
polypeptide, said second population being constituted by 

20 linear double stranded DNA molecules defining said 

restriction site, said restriction site located 3' 
terminal to said second genes, said second genes located 
3 1 terminal to a promoter controlling the expression of 
said second genes; 

25 (c) cleaving said double stranded DNA 

molecules while present in said populations with said 
endonuclease to produce restriction fragments having a 
cohesive terminus and either one of said first genes or 
one of said second genes; and 

30 (d) randomly ligating said restriction 

fragments one to another via said cohesive terminus of 
each to produce a diverse population of double stranded 
linear DNA expression vectors having one of said first 
genes and one of said second genes in tandem on the same 

35 DNA strand and under the control of a single promoter; 
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(e) isolating from said diverse population of 
double stranded DNA expression vectors a vector capable 
of expressing said heterodimeric receptor of 
predetermined specificity. 
5 2, The method of Claim l wherein said first 

genes are first polypeptide-coding DNA homologs and said 
isolating comprises: 

(a) separating the stands of a repertoire of 
first polypeptide-coding genes, said repertoire 

10 comprising double-stranded nucleic acids each containing 
a first polypeptide-coding strand annealed to a 
complementary strand; 

(b) treating said separated strands, under 
conditions suitable for polymerase chain reaction 

15 amplification, with first and second polynucleotide 

synthesis primers, each of said first primers having a 
nucleotide sequence capable of hybridizing to a sequence 
conserved among said first polypeptide-coding strands, 
and each of said second primers having a nucleotide 

20 sequence capable of hybridizing to a sequence conserved 

among said complementary strands, said first and second 
primers being capable of priming the amplification of a 
plurality of different first polypeptide-coding DNA 
homologs from said repertoire of first polypeptide-coding 
25 genes, said treating producing a plurality of different 

first polypeptide-coding DNA homologs; and 

(c) operatively linking to an expression 
vector for expression under the control of a promoter 
each of a plurality of said different first polypeptide- 

30 coding DNA homologs. 

3. The method of Claim 1 wherein said second 
genes are second polypeptide-coding DNA homologs and said 
isolating comprises: 

(a) separating the stands of a repertoire of 
35 second polypeptide-coding genes, said repertoire 
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comprising double-stranded nucleic acids each containing 
a second polypeptide-coding strand annealed to a 
complementary strand; 

(b) treating said separated strands, under 
5 conditions suitable for polymerase chain reaction 

amplification, with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide sequence capable of hybridizing to a sequence 
conserved among said second polypeptide-coding strands, 

10 and each of said second primers having a nucleotide 

sequence capable of hybridizing to a sequence conserved 
among said complementary strands, said first and second 
primers being capable of priming the amplification of a 
plurality of different second polypeptide-coding DNA 

15 homologs from said repertoire of second polypeptide- 

coding genes, said treating producing a plurality of 
different second polypeptide-coding DNA homologs; and 

(c) operatively linking to an expression 
vector for expression under the control of a promoter 

20 each of a plurality of said different second polypeptide- 
coding DNA homologs. 

4, The method of claim 1 wherein said first 
polypeptide is a V H , said first genes are V H -coding DNA 
homologs, and said isolating comprises: 

25 (a) separating the stands of a repertoire of 

V H -coding genes, said repertoire comprising double- 
stranded nucleic acids each containing a V H -coding strand 
annealed to a complementary strand; and 

(b) treating said separated strands, under 

30 conditions suitable for polymerase chain reaction 

amplification, with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide sequence capable of hybridizing to a sequence 
conserved among said v H - coding strands, and each of said 

35 second primers having a nucleotide sequence capable of 
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hybridizing to a sequence conserved among said 
complementary strands, said first and second primers 
being capable of priming the amplification of a plurality 
of different V H -coding DNA homologs from said repertoire 
5 of V H -coding genes, said treating producing a plurality of 
different V H -coding DNA homologs. 

5. The method of claim 1 wherein said second 
polypeptide is a V L , said second genes are V L -coding DNA 
homologs, and said isolating comprises: 
10 (a) separating the stands of a repertoire of 

V L -coding genes, said repertoire comprising double- 
stranded nucleic acids each containing a V L -coding strand 
annealed to a complementary strand; and 

(b) treating said separated strands, under 
15 conditions suitable for polymerase chain reaction 

amplification, with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide sequence capable of hybridizing to a sequence 
conserved among said V L -coding strands, and each of said 
20 second primers having a nucleotide sequence capable of 

hybridizing to a sequence conserved among said 
complementary strands, said first and second primers 
being capable of priming the amplification of a plurality 
of different V L -coding DNA homologs from said repertoire 
25 of V L -coding genes, said treating producing a plurality of 

different first polypeptide-coding DNA homologs. 

6. The method of claim 1 wherein said first 
polypeptide is a V L . said first genes are V L -coding DNA 
homologs, and said isolating comprises: 
30 (a) separating the stands of a repertoire of 

V L -coding genes, said repertoire comprising double- 
stranded nucleic acids each containing a V L -coding strand 
annealed to a complementary strand; and 

(b) treating said separated strands, under 
35 conditions suitable for polymerase chain reaction 
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amplification,- with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide sequence capable of hybridizing to a sequence 
conserved among said V L -coding strands, and each of said 
5 second primers having a nucleotide sequence capable of 
hybridizing to a sequence conserved among said 
complementary strands, said first and second primers 
being capable of priming the amplification of a plurality 
of different V L -coding DNA homologs from said repertoire 

10 of V L -coding genes, said treating producing a plurality of 
different first polypeptide-coding DNA homologs. 

7. The method of claim 1 wherein said second 
polypeptide is a V H , said second genes are V H -coding DNA 
homologs, and said isolating comprises: 

15 (a) separating the stands of a repertoire of 

V H -coding genes, said repertoire comprising double- 
stranded nucleic acids each containing a v H -coding strand 
annealed to a complementary strand; and 

(b) treating said separated strands, under 

20 conditions suitable for polymerase chain reaction 

amplification, with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide sequence capable of hybridizing to a sequence 
conserved among said V H -coding strands, and each of said 

25 second primers having a nucleotide sequence capable of 

hybridizing to a sequence conserved among said 
complementary strands, said first and second primers 
being capable of priming the amplification of a plurality 
of different V„-coding DNA homologs from said repertoire 

30 of V H -coding genes, said treating producing a plurality of 
different V„-coding DNA homologs, 

S. The method of claim 4 wherein said second 
polypeptide is a V L , said second genes are V L -coding DNA 
homologs, and said isolating comprises s 
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(a) separating the stands of a repertoire of 
V L -coding genes, said repertoire comprising double- 
stranded nucleic acids each containing a v L -coding strand 
annealed to a complementary strand; and 
5 (b) treating said separated strands, under 

conditions suitable for polymerase chain reaction 
amplification, with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide seguence capable of hybridizing to a sequence 
10 conserved among said V L -coding strands, and each of said 
second primers having a nucleotide seguence capable of 
hybridizing to a sequence conserved among said 
complementary strands, said first and second primers 
being capable of priming the amplification of a plurality 
15 of different v L -coding DNA homologs from said repertoire 

of V L -coding genes, said treating producing a plurality of 
different first polypeptide-coding DNA homologs. 

9. The method of claim 6 wherein said second 
polypeptide is a V H , said second genes are V H -coding DNA 
20 homologs,, and said isolating comprises: 

(a) separating the stands of a repertoire of 
V H -coding genes, said repertoire comprising double- 
stranded nucleic acids each containing a V H -coding strand 
annealed to a complementary strand; and 
25 (b) treating said separated strands, under 

conditions suitable for polymerase chain reaction 
amplification, with first and second polynucleotide 
synthesis primers, each of said first primers having a 
nucleotide seguence capable of hybridizing to a sequence 
3 0 conserved among said v H -coding strands, and each of said 
second primers having a nucleotide sequence capable of 
hybridizing to a seguence conserved among said 
complementary strands, said first and second primers 
being capable of priming the amplification of a plurality 
35 of different V H -coding DNA homologs from said repertoire 
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of v H -coding genes, said treating producing a plurality of 
different V H -eoding DNA homologs. 

10. A cloning system comprising first and 
second linear double stranded DNA vectors admixed in an 
5 aqueous medium together with an endonuclease, each of 

said vectors defining a promoter, a restriction site 
cleavable by said endonuclease,. and a polylinker,- said 
restriction site being located between said promoter and 
said polylinker on said first vector, said polylinker 
10 being located between said promoter and said restriction 
site on said second vector, said restriction site being 
cleavable to produce from each of said vectors respective 
first and second polylinker-containing restriction 
fragments having termini complementary to one another. 
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FIG. 5-1 

Subclass I (A) 
#L39 

CTCGAGTCAGGACCTGGCCTCGTGAAACCTTCTCAGTCTCTGTCTCTC 

ACCTG^CTGTCA^GGOIAOICCATCAC CAGTGCTTATTACTGGKAC 
TGGATCCGGCAGTT 



Subclass II (A) 
#L11 

CTCGAGTCTGGGCCTnAACTGGCAAAACCTGGGGCCTCAGTGAAGATG 
TCCTGCAAGGCTTCTGGCCACACCTTGAC TAGTTACTGGATACACT GG 
GTAAAAnAGAGGCC 



#L03 

CTCGAGTCTGGACCTnAGCTGGTAAAGCCTGGGGTTCAGTGAAGATGT 
CCTGCAAGGCTTCTGGATACACATTCACri AGCTATGTTATACACT GGG 
TGAAGCAGAAGCCT 

FIG. 5-2 

#L32 

CTCGAGTCTGGACCTGAACTGGTAAAGCCTGGGACTTCAGTGAAGATG 
TCCTGCAAGGCTTCTGGATACACATTCAC CAGCTATGTTATGCGC TGG 
GTGAAGCAGAAGCC 



Subclass II (B) 

#L37 

CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTTCAGTGAAGTTG 
TCCTGCAAGGCTTCTGGCTACACCTTCACl vAGCTACTATATGTACT GG 
GTGAAGCAGAGGCC 



#L06 

CT CG AGT CTGGGG CTAAG CTGGTAAGG CCTGGAG CTTnAGTnAAGCTG 
TeeTGnAGGGCTTCTGGCTACTCCTTCACll AGCTACTGGATGAACT GG 
GTGAAGCAGAGGCC 
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FIG. 5-3 



Subclass II (C) 
#L33 

CTCGAGTCTGGGGCTGAGCTGGTGAGGCCTGGAGCTTCAGTnAAGCTG 
TCCTGCAAGGCCTCTCGTACTCCTTCAC CAGCTCCTGATAACT GGGTG 
AA6CA6AGGCCTGG 

Subclass III (B) 
#L3 6 

CTCGAGTCAGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAAACTC 
TCCTGTGCAGCCTCAGGATTCGATTTnAGn AGATACTGGATGAAT TGG 
GTCCGGCAGCTCCA 



#L02 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAATCTC 
CCgrGTGCAGCCTCAGGATTCGATTTnAGn AGATAATGGATGAGT TGG 
GTTCGGCAGGCTCC 

FIG. 5-4 

#L31 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTGAAAGTC 
TCCTGTGCAGCCTCAGGATTCGATTTRAGR AGATACTGGATGAGTT Gfl 
GTCCGGCAGCTCCA 



#L34 

CTCGAGTCTGGAGGTGGCCTGGTGCAGCCTGGAGGATCCCTCAAACTC 
TCuiGTGCAGcCTCAGGATTCGATTTnAG nAGATACTGGATGAGT Tf:rt 
GTCCGGCAGCTCCA ~ 



#L50 

CTCGAGTCAGGAGGTGGCCTGGTGCAGCCTGGAGGAGCCCTGAAACTC 
TCCTGTGCAGCCTGAGGATTCGATTTrvAGriAGATACTGGATGAGTTGG 
GTCCGCAGCTCCAG " 
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FIG. 5-5 

Subclass III (C) 
#L10 

CTCGAGTCTGGGGGAGGCTTAGTrCAGCCTGGAGGGTCCCGGAAACTC 
TgCTGTGCAGCCTgTGGATTCACTTTnAG rxAGTTTTGGAATGCACT GG 
ATTCGTCAGGCTCC 

#L08 

CTCGAGTCTGGGGGAGGCTTAGTnnAGCCTGGAGGGTCCCGGAAACTC 
TCCTGTGCAGCCTCTGGATTCACZTTTnAG nAGCTTTGGAATGCACT GG 
GTTACGTCAGGCTC 



Subclass V (A) 
#L3S 

CTCGAGTCAGGGGCTGAACTGGTGAGGCCTGGGCGTTCAGTnAAGATG 
TCCTGCAAGGCTTCAGGCTATTCCTTCAC CAGCTACTGGATGCACT GG 
GTGAAACAGAGGCC 

FIG. 5-6 

Miscellaneous 
#47 

CTCGAGTCAGGGGCTGAACTGGCAAAACCTGGGGCCTaAGTAAAGATG 
TCCTGCAAGGCTTCTGGGTACACCTCTTCTTCCTTCTGGCTGCACTGG 
ATAAAAGAAGGCCT 

#L35 

CTCGAGTCTGGACCTnAGCTGGTGAAGCCTGGGGTTCAGTTAAAATAT 

CCxGCAAGGCTTCTGGTTACTCATTTTCTnTCTACTTTGTGAACT^ 

TGATGCAGAGCCAT 



#L48 

CTCGAGTCAGGGGCTGAACTGGTGAAGCCTGGGGTTCAGTAAGTTGTC 
CTGAAGGCTTCTGGGTAGACCTTCACGGGCTACTATATGTACTGGGTG 
AAGCAGAGGCCTGG 
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FIG-. 6.A-1 

xh EXPRESSION VECTOR: 

SHINE— DALGARNO MET 

GGCCGCAAATTCTATTTGAAGGAGACAGTCATAATG' 
CGTTTAAGATAAAGTTCCTCTGTCAGTATTAC 



LEADER SEQUENCE 

AAATACCTATTGCCTACGGCAGCCGCT 

TTTATGGATAACGGATGCCGTCGGCGA 



GGATTGTTATTACTCGCTGCCCAACCAG' 

CCTAACAATAATGAGCGACGGGTTGGTC 



FIG. 6A-2 



LINKER 



LINKER 



NCOI 


V H BACKBONE 


XKOI SPEI 


CCATGGCCC 


IAGGTGAAACTG' 


STCGAGXTTTCTAGACTAGT 



GGTACCGGGTCCACTTTGACGAGCTCTAAAGATCTGATCA 



STOP LINKER 

TyrproTyrAspVaiproAspTyrAiaSer; 

TACCCGTACGACGTTCCGGACTACGGTTCfTAATAGAATTCG' 
ATGGGCATGCTGCAAGGCCTGATGCCAAGAATTATCTTAAGCAGCT 
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FIG. 6B-1 



Yl EXPRESSION VECTOR: 



SHINE -DALGARNO 

GGCCGCAAATTCTATTTdAAGGAGACAGTCATA* 
CGTTTAAGATAAAGTTCCTCTGTCAGTAT 



MET LEADER SEQUENCE 

ATGAAATACCTATTGCCTACGGCAGCCGCTGGA 
TACTTTATGGATAACGGATGCCGTCGGCGACCT 



LEADER SEQUENCE 



TTGTTATTACTCGCTGCCCAACCAG' 
AACAATAATGAGCGACGGGTTGGTC 



FIG. 6B-2 

LINKER 



NCOI V H BACKBONE 

CCATGGCCCAGGTGAAACTG' 
GGTACCGGGTCCACTTTGAC 



LINKER 



XHOI SPEll STOP 

'CTCGAGAATTCTAGACTAGTTAATAG' 
GAGCTCTTAAGATCTGATCAATTATCAGCT 
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FIG. 12 



SUBSTITUTE SHEET 



WO 90/14424 



21 /25 



PCT/US90/02835 




SUBSTITUTE SHEET 



WO 90/14424 



22/25 



PCT/US90/02835 




SUBSTITUTE SHEET 



WO 90/14424 



23 /25 



PCI7US90/Q2835 




SUBSTITUTE SHEET 



WO 90/14424 



24/25 



PCT/US90/02835 



FIG. 16 

[inhibitor] 

• * 0 

# # 6.7 x1CT 2 
9 * 6.7 xlO"' 8 
0 • 6.7xl(T 9 

6.7x10-' 
6.7 x10" 7 
6.7x10* 
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